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435/040.500 

435/029.000; 435/283.100 



L2 
AN 
TI 
IN 



PI 
AI 
RLI 
PRAI 



ANSWER 4 OF 104 US PAT FULL on STN 
2004 : 126898 US PAT FULL 

Novel proteins and nucleic acids encoding same 
Taupier, Raymond J., JR., East Haven, CT, UNITED STATES 
Padigaru, Mural idhara, Branford, CT, UNITED STATES 
Rastelli, Luca, Guilford, CT, UNITED STATES 
Spaderna, Steven Kurt, Berlin, CT, UNITED STATES 
Shimkets, Richard A., West Haven, CT, UNITED STATES 
zerhusen, Bryan D., Branford, CT, UNITED STATES 
spytek, Kimberly Ann, New Haven, CT, UNITED STATES 
shenoy, suresh G. , Branford, CT, UNITED STATES 
Li, Li, Cheshire, CT, UNITED STATES 
Gusev, Vladimir Y. , Madison, CT, united STATES 

, Branford, CT, UNITED STATES 
, II, Madison, CT, UNITED STATES 
Branford, CT, UNITED STATES 
E., Wethersfield, CT, UNITED STATES 
Branford, CT, UNITED STATES 
Ellerman, Karen, Branford, CT, UNITED STATES 
MacDougall , John R., Hamden, CT, UNITED STATES 
Stone, David J., Guilford, CT, UNITED STATES 
smithson, Glennda, Guilford, CT, UNITED STATES 
US 2004096877 Al 20040520 

US 2003-624932 Al 20030721 (10) 

continuation of Ser. No. US 2001-918779, filed on 30 Jul 2001, ABANDONED 



Grosse, William M. , 
Alsobrook, John P. , 
Lepley , Denise M. , 
Burgess , Catherine 
GerTach, Valerie L. 



US 2000-221409P 
US 2000-222840P 
US 2000-223752P 
US 2000-223762P 
US 2000-223770P 
US 2000-223769P 
US 2000-225146P 
US 2000-225392P 
US 2000-225470P 
US 2000-225697P 
US 2001-263662P 
US 2001-281645P 
DT Utility 
FS APPLICATION 
LN.CNT 11006 
INCL 



20000728 (60) 
20000804 (60) 
20000808 (60) 
20000808 (60) 
20000808 (60) 
20000808 (60) 

20000814 (60) 

20000815 (60) 

20000815 (60) 

20000816 (60) 
20010201 (60) 
20010405 (60) 



NCL 



IC 



INCLM: 435/006.000 

INCLS: 435/069.100; 435/320.100; 435/325.000; 530/350.000; 530/388.100; 

536/023.500 
NCLM: 435/006.000 

NCLS: 435/069.100; 435/320.100; 435/325.000; 530/350.000; 530/388.100; 
536/023.500 



[7] 
ICM: 
ICS: 



C12Q001-68 

C07H021-04; C07K014-47; C07K016-18 



CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L2 ANSWER 5 OF 104 US PAT FULL on STN 

AN 2004:94708 US PAT FULL 

TI Molecular toxicology modeling 

IN Mendrick, Donna, Gaithersburg, MD, UNITED STATES 
Porter, Mark, Gaithersburg, MD, UNITED STATES 
Johnson, Kory, Gaithersburg, MD, united STATES 
Higgs, Brandon, Gaithersburg, MD, UNITED STATES 
Castle, Arthur, Gaithersburg, MD, UNITED STATES 
Elashoff, Michael, Gaithersburg, MD, UNITED STATES 

PI US 2004072160 Al 20040415 

AI US 2002-152319 Al 20020522 (10) 

PRAI US 2001-292335P 20010522 (60) 
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US 


2002- 


372794P 


20020417 


(60^ 


Utility 









FS APPLICATION 
LN.CNT 27909 

INCL INCLM: 435/006.000 

INCLS: 435/091.200; 436/084.000 
NCL NCLM: 435/006.000 

NCLS: 435/091.200; 436/084.000 
IC [7] 

ICM: C12Q001-68 

ICS: C12P019-34; G01N033-20 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L2 ANSWER 6 OF 104 USPATFULL on STN 
AN 2004:69579 USPATFULL 

TI Proteins and nucleic acids encoding same 

IN Kekuda, Ramesh, Danbury, CT, UNITED states 

Alsobrook, John P., II, Madison, CT, UNITED STATES 
Tchernev, Velizar T. , Branford, CT, UNITED STATES 
Liu, xiaohong, Branford, CT, united STATES 
spytek, Kimberly A., New Haven, CT, UNITED STATES 
Patturajan, Meera, Branford, CT, UNITED STATES 
Grosse, William M., Branford, CT, UNITED STATES 
Lepley, Denise M. , Branford, CT, UNITED STATES 
Burgess, Catherine E., Wethersfield, CT, UNITED STATES 
Vernet, Corine A.M., Branford, CT, UNITED STATES 
Li, Li, Branford, CT, UNITED STATES 
Gorman, Linda, Branford, CT, UNITED STATES 
Edinger, Shlomit R., New Haven, CT, UNITED STATES 
sciore, Paul, North Haven, CT, united STATES 
Ellerman, Karen, Branford, CT, UNITED STATES 
Malyankar, Uriel M., Branford, CT, UNITED STATES 
Rothenberg, Mark E., Clinton, CT, UNITED STATES 
stone, David J., Guilford, CT, UNITED STATES 
Boldog, Ferenc L. , North Haven, CT, UNITED STATES 
Guo, xiaojia (Sasha) , Branford, CT, UNITED states 
shenoy, Suresh G. , Branford, CT, UNITED STATES 
Anderson, David w., Branford, CT, UNITED states 
Padigaru, Mural idhara, Branford, CT, UNITED STATES 
Taupier, Raymond J . , JR., East Haven, CT, UNITED STATES 
Miller, Charles E., Guilford, CT, united STATES 
Eisen, Andrew, Rockville, MD, UNITED STATES 

PI US 2004052806 Al 20040318 

AI US 2002-37417 Al 20020104 (10) 

PRAI US 2001-260018P 20010105 (60) 
US 2001-260360P 20010108 (60) 
US 2001-272411P 20010228 (60) 
US 2001-272817P 20010302 (60) 
US 2001-291186P 20010515 (60) 
US 2001-303231P 20010705 (60) 
US 2001-305060P 20010712 (60) 
US 2001-318405P 20010910 (60) 
US 2001-318700P 20010912 (60) 

DT Utility 

FS APPLICATION 

LN.CNT 13212 



INCL INCLM: 424/185.100 

INCLS: 435/069.100; 435/183.000; 435/320.100; 435/325.000; 530/350.000; 
536/023.200 
NCL NCLM : 424/185.100 

NCLS: 435/069.100; 435/183.000; 435/320.100; 435/325.000; 530/350.000; 
536/023.200 

IC [7] 

ICM: C07H021-04 

ICS: C12N009-00; A61K039-00; C12P021-02; C12N005-06; C07K014-47 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L2 ANSWER 7 OF 104 US PAT FULL on STN 
AN 2004:63731 US PAT FULL 

TI Novel nucleic acids and secreted polypeptides 
IN Tang, Y. Tom, San Jose, CA, UNITED STATES 

Yang, Yonqhong, San Jose, CA, UNITED STATES 

Weng, Gezhi , Piedmont, CA, UNITED STATES 

Zhang, Jie, Campbell, CA, united states 

Ren, Feiyan, Cupertino, CA, UNITED STATES 

Xue, Aidong, Sunnyvale, CA, UNITED STATES 

Wang, Jian-Rui, Cupertino, CA, UNITED STATES 

Wehrman, Tom, Stanford, CA, UNITED STATES 

Ghosh, Malabika J . , Sunnyvale, CA, UNITED STATES 

Wang, Dunrui , Poway, CA, UNITED STATES 

Zhao, Qing A., San Jose, CA, UNITED STATES 

Wang, zhiwei , Sunnyvale, CA, united states 
PI US 2004048249 Al 20040311 

AI US 2002-112944 Al 20020328 (10) 

RLI Continuation-in-part of Ser. No. US 2000-488725, filed on 21 Jan 2000, 

PENDING Continuation-in-part of ser. No. US 2000-491404, filed on 25 Jan 
2000, ABANDONED Continuation-in-part of Ser. No. US 2000-496914, filed 
on 3 Feb 2000, ABANDONED Continuation-in-part of Ser. No. US 
2000-515126, filed on 28 Feb 2000, ABANDONED Continuation-in-part of 
Ser. No. US 2000-519705, filed on 7 Mar 2000, ABANDONED 
continuation-in-part of Ser. No. us 2000-540217, filed on 31 Mar 2000, 
ABANDONED Conti nuati on-i n-part of Ser. No. us 2000-552929, filed on 18 
Apr 2000, ABANDONED Conti nuation-i n-part of Ser. No. US 2000-577408, 
filed on 18 May 2000, ABANDONED 
US 2001-306971P 20010721 (60) 
Utility 
APPLICATION 
LN.CNT 2 3809 

INCL INCLM : 435/006.000 

INCLS: 435/069.100; 435/183.000; 435/320.100; 435/325.000; 435/455.000; 

530/350.000; 536/023.200 
NCLM: 435/006.000 

NCLS: 435/069.100; 435/183.000; 435/320.100; 435/325.000; 435/455.000; 
530/350.000; 536/023.200 



PRAI 

DT 

FS 



NCL 



IC 



[7] 
ICM: 
ics: 



C12Q001-68 

C07H021-04; C12N009-00; C12P021-02; C12N005-06; C07K014-47; 
C12N015-85 

CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L2 ANSWER 8 OF 104 USPATFULL on STN 
AN 2004:63727 USPATFULL 

TI Novel human proteins, polynucleotides encoding them and methods of using 
the same 

IN Shimkets, Richard A., West Haven, CT, UNITED STATES 

Taupier, Raymond J., jr., East Haven, CT, UNITED STATES 
Burgess, Catherine E., Wethersfield, CT, UNITED STATES 
Zerhusen, Bryan D., Branford, CT, UNITED STATES 
Mezes, Peter s., old Lyme, CT, UNITED STATES 
Rastelli, Luca, Guilford, CT, UNITED states 
Malyankar, Uriel M. , Branford, CT, united STATES 
Grosse, William M . , Branford, CT, UNITED STATES 
Alsobrook, John P., II, Madison, CT, UNITED STATES 
Lepley, Denise M. , Branford, CT, UNITED STATES 
spytek, Kimberly Ann, New Haven, CT, UNITED STATES 
Li, Li, Cheshire, CT, UNITED STATES 
Edinger, Shi omit, New Haven, CT, UNITED STATES 
Gerlach, Valerie, Branford, CT, UNITED STATES 
Ellerman, Karen, Branford, CT, UNITED STATES 
MacDougall , John R., Hamden, CT, UNITED STATES 
Gunther, Erik, UNITED STATES 

Millet, Isabel le, Mil ford, CT, UNITED STATES 



Stone, David J., Guilford, CT, UNITED STATES 
smithson, Glennda, Guilford, CT, UNITED STATES 
Szekeres, Edward S., JR., Branford, CT, UNITED STATES 



PI 
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■238383P 


20001006 


(60) 
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-238382P 


20001006 
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2001- 


-275892P 


20010314 


(60) 




us 


2001- 


-296860P 
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(60) 


DT 


utility 







FS APPLICATION 
LN.CNT 8458 

INCL INCLM : 435/006.000 

INCLS: 435/069.100; 435/325.000; 435/320.100; 530/388.260; 536/023.200; 
435/183.000 
NCL NCLM : 435/006.000 

NCLS: 435/069.100; 435/325.000; 435/320.100; 530/388.260; 536/023.200; 
435/183.000 

IC [7] 

ICM: C12Q001-68 

ICS: C07H021-04; C12N009-00; C07K016-40; C12P021-02; C12N005-06 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 

L2 ANSWER 9 OF 104 US PAT FULL on STN 

AN 2004:58174 US PAT FULL 

TI Novel nucleic acids and polypeptides 

IN Tang, Y. Tom, San Jose, CA, UNITED STATES 

Liu, Chenghua, San Jose, CA, UNITED STATES 

Asundi, Vinod, Foster city, CA, UNITED STATES 

Wehrman, Tom, Stanford, CA, UNITED STATES 

Ren, Feiyan, Cupertino, CA, UNITED STATES 

Zhou, Ping, Cupertino, CA, UNITED STATES 

Zhao, Qing A., San Jose, CA, UNITED states 

Drmanac, Radoje T., Palo Alto, CA, UNITED STATES 

Zhang, Jie, Campbell, CA, UNITED STATES 

Xue, Aidong, Sunnyvale, CA, UNITED states 

Wang, Jian-Rui , Cupertino, CA, UNITED STATES 

Wang, Dunrui , Poway, CA, UNITED STATES 
PI US 2004044181 Al 20040304 

AI US 2003-363616 Al 20030715 (10) 

WO 2001-US27093 20010831 
DT Utility 
FS APPLICATION 
LN.CNT 17667 

INCL INCLM : 530/350.000 

INCLS: 435/069.100; 435/320.100; 435/325.000; 536/023.500 
NCL NCLM: 530/350.000 

NCLS: 435/069.100; 435/320.100; 435/325.000; 536/023.500 
IC [7] 

ICM: C07K014-705 

ICS: C12P021-02; C12N005-06; C07H021-04 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 

L2 ANSWER 10 OF 104 USPATFULL on STN 
AN 2004:44503 USPATFULL 

TI Methods of diagnosis of angiogenesis , compositions and methods of 

screening for angiogenesis modulators 
IN Murray, Richard, Cupertino, CA, UNITED STATES 

Glynne, Richard, Palo Alto, CA, UNITED STATES 

Watson, Susan R., El cerrito, ca, UNITED states 

Aziz, Natasha, Palo Alto, CA, united states 
PA Eos Biotechnology, Inc., South San Francisco, CA, united states, 94080 

(U.S. corporation) 
PI US 2004033495 Al 20040219 

AI US 2002-211462 Al 20020801 (10) 
PRAI US 2001-310025P 20010803 (60) 



US 2001-334244P 20011129 (60) 
DT Utility 
FS APPLICATION 
LN.CNT 24599 

INCL INCLM: 435/006.000 

INCLS: 435/007.230; 435/069.100; 435/320.100; 435/325.000; 536/023.200 
NCL NCLM : 435/006.000 

NCLS: 435/007.230; 435/069.100; 435/320.100; 435/325.000; 536/023.200 
IC [7] 

ICM: C12Q001-68 

ICS: G01N033-574; C07H021-04; C12P021-02; C12N005-06 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L2 
AN 
TI 
IN 



PI 
AI 
RLI 
PRAI 



DT 
FS 



ANSWER 11 OF 104 US PAT FULL on STN 
2004:38683 US PAT FULL 

Proteins and nucleic acids encoding same 
Edinger, shlomit R., New Haven, CT, united states 
MacDougall , John R., Hamden, CT, UNITED STATES 
Millet, Isabel le, Mil ford, CT, united states 
Ellerman, Karen, Branford, CT, UNITED STATES 
Stone, David J . , Guilford, CT, UNITED STATES 
Gerlach, Valerie, Branford, CT, UNITED STATES 
Grosse, William M. , Branford, CT, UNITED STATES 
Alsobrook, John P., II, Madison, CT, UNITED STATES 
Lepley, Denise M., Branford, CT, UNITED STATES 
Rieger, Danier K. , Branford, CT, united STATES 
Burgess, Catherine E., Wethersfield, CT, UNITED STATES 
Casman, stacie J., North Haven, CT, UNITED STATES 
Spytek, Kimberly A., New Haven, CT, UNITED STATES 
Boldog, Ference L., North Haven, CT, UNITED STATES 
Li, Li , Branford, CT, UNITED STATES 
Padigaru, Mural idhara, Branford, CT, UNITED STATES 
Mishra, Vishnu, Gainesville, FL, UNITED STATES 
Patturajan, Meera, Branford, CT, UNITED STATES 
shenoy, suresh G. , Branford, CT, UNITED STATES 
Rastelli, Luca, Guilford, CT, UNITED STATES 
Tchernev, Velizar T. , Branford, CT, united STATES 
Vernet, Corine A.M., Branford, CT, UNITED STATES 
Zerhusen, Bryan D., Branford, CT, UNITED STATES 
Malyankar, Uriel M., Branford, CT, UNITED STATES 
Guo, Xiaoiia (Sasha) , Branford, CT, UNITED STATES 
Miller, Charles E., Guilford, CT, UNITED STATES 
Gangolli, Esha A. , Madison, CT, UNITED STATES 
Grosse, Michael, UNITED STATES LR 
US 2004029222 Al 20040212 

US 2002-218779 Al 20020814 (10) 

Continuation of Ser. No. US 2001-995514, filed on 28 Nov 2001, ABANDONED 



US 2000-253834P 
US 2OO0-25O926P 
US 2001-264180P 
US 2001-313656P 
US 2001-327456P 
Utility 
APPLICATION 



20001129 
20001130 
20010125 
20010820 
20011005 



(60) 
(60) 
(60) 
(60) 
(60) 



LN.CNT 15385 



INCL 



NCL 



IC 



INCLM: 
INCLS: 

NCLM: 
NCLS: 



[7] 
ICM: 
ICS: 



435/069.100 

435/183.000; 

530/388.100; 

435/069.100 

435/183.000; 

530/388.100; 



435/320.100; 435/325.000; 530/350.000; 536/023.200; 

435/007.230; 435/006.000 

435/320.100; 435/325.000; 530/350.000; 536/023.200; 

435/007.230; 435/006.000 



C12Q001-68 

G01N033-574; C07H021-04; C12N009-00; C12P021-02; C12N005-06; 
C07K014-47; C07K016-30 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L2 ANSWER 12 OF 104 USPATFULL on STN 
AN 2004:38577 USPATFULL 

TI Proteins and nucleic acids encoding same 
IN Edinger, shlomit R., New Haven, CT, UNITED STATES 
MacDougall, John R. , Hamden, CT, UNITED STATES 
Millet, Isabel! e, Mil ford, CT, UNITED STATES 
Ellerman, Karen, Branford, CT, UNITED STATES 
stone, David J., Guilford, CT, UNITED STATES 
Gerlach, Valerie, Branford, CT, UNITED STATES 



Grosse, William M . , Branford, CT, UNITED STATES 

Alsobrook, John P., II, Madison, CT, UNITED STATES 

Lepley, Denise M., Branford, CT, UNITED STATES 

Rieger, Daniel K. , Branford, CT, UNITED STATES 

Burgess, Catherine E., wethersfield, CT, UNITED STATES 

casman, Stacie J., North Haven, CT, UNITED STATES 

Spytek, Kimberly A., New Haven, CT, UNITED STATES 

Boldog, Ferenc L., North Haven, CT, UNITED STATES 

Li, Li, Branford, CT, UNITED STATES 

Padigaru, Mural idhara, Branford, CT, UNITED STATES 

Mishra, Vishnu, Gainesville, FL, UNITED STATES 

Patturajan, Meera, Branford, CT, UNITED STATES 

shenoy, suresh G., Branford, CT, UNITED STATES 

Rastelli, Luca, Guilford, CT, UNITED STATES 

Tchernev, Velizar T., Branford, CT, UNITED STATES 

Vernet, Corine A.M., Branford, CT, UNITED STATES 

Zerhusen, Bryan D., Branford, CT, UNITED STATES 

Malyankar, Uriel M., Branford, CT, UNITED STATES 

Guo, Xiaojia, Branford, CT, UNITED STATES 

Miller, Charles E., Guilford, CT, UNITED STATES 

Gangolli, EshaA., Madison, CT, UNITED STATES 
PI US 2004029116 Al 20040212 

AI US 2002-87684 Al 20020301 (10) 

PRAI US 2001-313656P 20010820 (60) 

US 2001-274194P 20010308 (60) 

US 2001-327456P 20011005 (60) 
DT Utility 
FS APPLICATION 
LN.CNT 15489 

INCL INCLM : 435/006.000 

INCLS: 435/069.100; 435/183.000; 435/320.100; 435/325.000; 530/350.000; 
536/023.200 
NCL NCLM: 435/006.000 

NCLS: 435/069.100; 435/183.000; 435/320.100; 435/325.000; 530/350.000; 
536/023.200 

IC [7] 

icm: C12Q001-68 

ICS: C07H021-04; C12N009-00; C12P021-02; C12N005-06; C07K014-47 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 

L2 ANSWER 13 OF 104 US PAT FULL on STN 
AN 2004:31106 US PAT FULL 

Tl Receptors 

IN Griffin, Jennifer A, Fremont, CA, UNITED STATES 

Kallick, Deborah A, Galveston, TX, UNITED STATES 

Tribouley, Catherine M, San Francisco, CA, UNITED STATES 

Yue, Henry, Sunnyvale, CA, UNITED states 

Nguyen, Danniel B, San Jose, CA, UNITED STATES 

Tang, Y Tom, San Jose, CA, united STATES 

Lai, Preeti G, Santa Clara, CA, UNITED STATES 

Policky, Jennifer L., San Jose, CA, UNITED STATES 

Azimzai, Yalda, Oakland, CA, UNITED STATES 

Lu, Dyung Aina M, San Jose, CA, UNITED STATES 

Graul , Richard c, san Francisco, CA, UNITED STATES 

Yao, Monique G, Carmel , IN, UNITED STATES 

Burford, Neil, Durham, CT, UNITED STATES 

Hafalia, April J A, Daly city, CA, united STATES 

Baughn, Mariah R, San Leandro, CA, UNITED STATES 

Bandman, olga, Mountain View, CA, UNITED STATES 

Arvizu, Chandra s, San Jose, CA, UNITED STATES 

xu, Yuming, Mountain View, CA, UNITED STATES 

Gandhi, Ameena R, San Francisco, CA, UNITED STATES 

Warren, Bridget A, Encinitas, CA, UNITED STATES 

Ding, Li, Creve Coeur, MO, UNITED STATES 

sanjanwala, Madhusudan M, Los Altos, CA, UNITED STATES 

Duggan, Brendan M, Sunnyvale, CA, UNITED STATES 

Lu, Yan, Mountain view, CA, UNITED STATES 

Yang, Junming, san Jose, CA, UNITED STATES 
PI US 2004023244 Al 20040205 

AI US 2003-311623 Al 20030516 (10) 

WO 2001-US19942 20010621 
DT Utility 
FS APPLICATION 
LN.CNT 8061 

INCL INCLM: 435/006.000 

INCLS: 435/069.100; 435/183.000; 435/320.100; 435/325.000 



435/006.000 

435/069.100; 435/183.000; 435/320.100; 435/325.000 



NCL NCLM: 

NCLS: 
IC [7] 

ICM: C12Q001-68 

ICS: C12N009-00; C12P021-02; C12N005-06 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L2 
AN 
TI 

IN 



PI 
AI 
RLI 



PRAI 



DT 
FS 



ANSWER 14 OF 104 US PAT FULL on STN 
2004:18871 US PAT FULL 

Novel polynucleotides, polypeptides encoded thereby and methods of use 
thereof 

Anderson, David W. , Plantsville, CT, UNITED STATES 
Boldog, Ferenc L., North Haven, CT, UNITED STATES 
casman, stacie J., North Haven, CT, UNITED STATES 
Edinger, shlomit R. , New Haven, CT, UNITED STATES 
Ellerman, Karen, Branford, CT, UNITED states 
Fernandes, Elma R., Branford, CT, UNITED states 
Gunther, Erik, Branford, CT, UNITED states 
Leach, Martin D. , Madison, CT, UNITED STATES 
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CSS Supported By: NATIONAL INSTITUTE OF NEUROLOGICAL DISORDERS AND STROKE 

DB 2003 (/09/00) 

FYR 2003 

DE 2002 C/29/04) 

FU Noncompeting Continuation (Type 5) 

FS National institutes of Health 
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AN 2004:176660 FEDRIP 

NR CRISP 5R01GM40613-13 

TI Molecular Genetics of Drosophila Neural Development 

SF Principal Investigator: THOMAS, JOHN B; JTHOMAS@SALK.EDU, salk INST FOR 

BIOLOGICAL STUDIES, PO BOX 85800 

CSP SALK INSTITUTE FOR BIOLOGICAL STUDIES, SAN DIEGO, CALIFORNIA 

css supported By: national institute of general medical sciences 

DB 2004 C/01/90) 

FYR 2003 

DE 2003 (/31/06) 

FU Noncompeting Continuation (Type 5) 

FS National Institutes of Health 
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Locus (LOC): AY510109 GenBank (R) 

GenBank ACC. No. (gbn) : AY510109 

GenBank VERSION (VER) : AY510109.1 GI:46095340 

CAS REGISTRY NO. (rn) : 676382-19-7 



SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 

source: 

ORGANISM (ORGN): 



REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 
JOURNAL (SO): 



1557 

mRNA; linear 
Primates 
19 Apr 2004 

Homo sapiens ZU5 and death domain-containing inhibitor 
of NF-kB mRNA, complete cds. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

1 (bases 1 to 1557) 

Zhang, J.; xu,L.G.; Han,K.J.; Shu.H.B. 
Identification of a ZU5 and death domain-containing 
inhibitor of NF-kappaB 

J. Biol, chem., 279 (17), 17819-17825 (2004) 

2 (bases 1 to 1557) 

Zhang, J.; Xu,L.-G.; Han,K.-J.; Shu,H.-B. 
Direct Submission 

Submitted (21-DEC-2003) immunology, National Jewish 
Center, 1400 Jackson Street, Denver, CO 80206, USA 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



source 



CDS 



1..1557 



1..1557 



misc-feature 322 . .558 



misc-feature 1267. .1473 



/orqanism="Homo sapiens" 

/mol-type= ,, mRNA M 

/db-xref="taxon:9606" 

/note= M ZUD" 

/codon-start=l 

/product="ZU5 and death 

domain-containing inhibitor of 

NF-kB" 

/protei n-i d="AAS80165 . 1" 

/db-xref="GI: 46095341" 

/transl ation="MCPQESSFQPSQFLLLVGVP 

VAS V L L LAQC L RWHC P R R L LGACW 

TLNGQEEPVSQPTPQLENEVSRQHLPATLPEMVA 

F YQ E L HT PTQGQTM V RQ LM H K L L V 

FS AR EVDH R GGC L M LQDTG I S L L I P PGAVAVG RQ 

ERVSLILVWDLSDAPSLSQAQGLV 

S PWACGPHGAS FLKPCTLTFKHCAEQPSHARTY 

S S NTT L LDAKVWR P LG R PGAH AS R 

DECRIHLSHFSLYTCVLEAPVGREARKWLQLAVF 

CSPLVPGQSHLQLRIYFLNNTPCA 

LQWALTNEQPHGGRLRGPCQLFDFNGARGDQCLK 

LTYIS EGWENVDDSSCQLVPH LHI 

WHGKCPFRSFCFRRKAADENEDCSALTNEIIVTM 

HTFQDGLETKYMEILRFQASEEES 

WAAPPPVSQPPPCNRLPPELFEQLRMLLEPNSIT 

GNDWRGLASHLGLCGMKIRFLSCQ 

RS PAAAILELFEEQNGS LQELHYLMTVMERLDCA 

SAIQN YLSGTHGGS PGPERGGARD 

NQGLELDEKL" 

/note="present in ZO-1 and 
Unc5-like netrin receptors; 
Region: ZU5 domain" 
/note="Region: death domain" 



SEQUENCE (SEQ) : 

1 atgtgccccc 
61 gtggcaagtg 
121 ggggcctgct 
181 gaaaatgagg 
241 caggagctac 
301 ttggtgtttt 
361 ggcatctcct 
421 ttgatcctgg 
481 agccctgtgg 
541 ttcaaacact 
601 ctggatgcca 
661 tgtcgcatcc 
721 cgcgaagccc 
781 tcccatctgc 
841 ctgaccaacg 
901 aatggggcta 



aggagagttc 
tcctccttct 
ggacactgaa 
tctcaaggca 
acacacccac 
cggctcgaga 
tgctcatccc 
tgtgggacct 
tggcatgtgg 
gtgccgagca 
aggtatggag 
acctctccca 
gcaaatggct 
aactgcgtat 
agcagcccca 
ggggcgacca 



attccaaccc 
ggcccaatgc 
tggtcaagag 
gcacctgcca 
tcaaggccag 
ggtggatcac 
accaggtgct 
gtcggacgcc 
cccccatggg 
gcccagccat 
gcccctgggg 
cttcagcctc 
gcagctggcc 
ctacttcctc 
tggtgggcgc 
gtgcctgaag 



tcccagttcc 
cttcgatggc 
gaaccagtgt 
gccacactgc 
accatggtcc 
cgcggcggtt 
gtggctgtgg 
ccatcgctgt 
gcctccttcc 
gctcgcacct 
cggccggggg 
tacacctgtg 
gtattctgct 
aacaacacgc 
ctgcgtgggc 
ctcacgtaca 



tactgctggt 
actgccctag 
cccagcctac 
cagagatggt 
gccagttgat 
gcctgatgct 
gccgccagga 
cccaagccca 
tgaagccttg 
acagcagcaa 
cccacgcctc 
tgctggaggc 
caccgctggt 
cctgcgccct 
cctgccagct 
tctcagaggg 



gggggtccca 
aaggctgctg 
cccccaacta 
tgccttctac 
gcacaaactg 
ccaggataca 
gcgggtgtct 

ggggctggta 

cactctcacg 
cactaccctg 
ccgggatgag 
acctgtgggg 
gccaggacag 
gcagtgggca 
cttcgacttc 
ttgggagaat 



961 gtggatgaca 
1021 ttccgctcct 
1081 accaatgaga 
1141 gaaatcctca 
1201 cagccgcccc 
1261 gagccaaaca 
1321 ggcatgaaga 
1381 tttgaggagc 
1441 ctagactgcg 
1501 cccgagcgcg 



gcagttgcca gctggttccc 
tctgcttccg gagaaaagca 
tcattgtcac catgcacacc 
gattccaggc atcagaggag 
catgcaatag gctgccccca 
gcatcaccgg caatgactgg 
tccggttcct gtcctgccag 
agaacggcag cctgcaggag 
cctccgccat ccagaactac 
ggggcgcccg ggataaccag 



catctccaca tctggcatgg 
gccgatgaga atgaggactg 
ttccaggatg gcttggagac 
gaatcctggg cagcgccacc 
gagctctttg agcagctgcg 
cgcggactgg cctcccacct 
cgcagccccg cagcggccat 
ctgcactacc tcatgaccgt 
ctgagtggga cacacggcgg 
ggcctggagc tggacgagaa 



aaagtgcccc 
ttcagcacta 
caagtatatg 
acctgtttcc 
gatgttattg 
ggggctttgc 
cctggagttg 
catggagcgg 
cagcccaggc 
gctctga 
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LOCUS (LOC): 
GenBank ACC. NO. (GBN) : 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF) : 

SOURCE: 
ORGANISM (ORGN): 



REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 

JOURNAL (SO) 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) 



GenBank (R) 



AY510108 
AY510108 
AY510108.1 Gl: 46095338 
676382-17-5 
1557 

mRNA; linear 

Rodents 

19 Apr 2004 

Mus musculus ZU5 and death domain-containing inhibitor 
of NF-kB mRNA, complete cds. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
sciurognathi ; Muridae; Murinae; Mus 

1 (bases 1 to 1557) 

Zhang, J.; Xu,L.G.; Han,K.J.; shu,H.B. 
Identification of a ZU5 and death domain-containing 
inhibitor of NF-kappaB 

J. Biol. Chem., 279 (17), 17819-17825 (2004) 

2 (bases 1 to 1557) 

Zhang, J.; xu,L.-G.; Han,K.-J.; shu,H.-B. 
Direct submission 

submitted (21-DEC-2003) Immunology, National Jewish 
Center, 1400 Jackson street, Denver, CO 80206, USA 



FEATURES (FEAT) : 
Feature Key 



Location 



Qualifier 



source 



CDS 



1. .1557 



1. .1557 



misc-feature 322. .558 



misc-feature 1264. . 1473 



/organism="Mus musculus 1 ' 

/mol-type="mRNA" 

/db-xref="taxon : 10090" 

/chromosome= ,, 17 M 

/note="ZUD" 

/codon-start=l 

/product="ZU5 and death 

domain-containing inhibitor of 

NF-kB" 

/protei n-i d="AAS80164 . 1" 

/db-xref="GI:46095339" 

/translation="MSPQESSVQPSQFLLLVGiP 

VASALLLAQCLRWHCCQWLPGTCR 

KLDDPEEPVSPSTPLPEYELPRQCPAPTLPEVAA 

FYQE L HM PTQGQT VT RQMM H K L LV 

FSAREVDH RGGCLI LQDTGIS L LI PPGAVTVGRQ 

ERVSLVLVWDLTDAPSLSHRQGLV 

SPWACGPHGASFLKPCTLTFKHCAQQPSQACAY 

S S NTS L LDAKDWK P LGQ PGTH I S R 

DECRI LLSH FS LYTCVLEAPLGQTARKWLQLAMF 

CSPLVPGQTHLQLRVYFLNNTPCA 

LQWAITNEQPHGGRMRGPCQLFDFTGARADQCLK 

LKYISEGWENVDDSSSQLVPHLHI 

WHGKCPFRSFCFRRKAANGNEECSALTNEIIVTM 

HT FQDG L ETK YVE I L R FQAS E E ET 

WAVPPPVSQPPLCNRLPPELFEQLQMLLEPNSVT 

GNDWRRLASHLGLCGMKIRVLSCQ 

RSPAAAILELFEEQNGSLQELHYLMTSMERLDCA 

SAIQNYLNRSPRGSPDRLHGGTWE 

NHGLELDEKL" 

/note="present in zo-1 and 
Unc5-like netrin receptors; 
Region: ZU5 domain" 
/note= M Region: death domain" 



SEQUENCE (SEQ): 

1 atgagccccc aggagagctc tgttcaacct tcccagttcc tcctgttggt ggggatccct 

61 gtggccagtg ctctccttct ggctcagtgt cttcgatggc actgctgtca gtggctgcca 

121 gggacctgcc ggaagctgga tgatccagag gagccagtgt ccccatccac tccattacca 

181 gagtatgagc tcccgaggca gtgcccagcc cccacactac cggaggtggc tgcgttctac 

241 caggaactcc acatgcctac ccagggccag actgtcaccc gccagatgat gcataagcta 

301 ctggtgtttt ctgctcgaga ggtggatcac cgtggtggat gcctgatcct gcaggacaca 

361 ggcatctccc tgctcatccc gccaggtgct gtgaccgtgg gtcgccagga gagggtgtcc 

421 ttggttctgg tgtgggacct gacagatgcc ccgtcactgt ctcacagaca gggactagtg 

481 agtcctgtgg tggcctgtgg ccctcacggg gcctctttcc tgaagccttg caccctcacg 

541 ttcaagcact gcgcccaaca gcccagtcaa gcatgtgcct acagcagcaa tacctccttg 

601 ctggatgcca aggactggaa acccctgggt cagccgggga ctcatatctc cagggacgag 

661 tgtcgtatcc tgctctctca cttcagtctc tacacctgcg ttctggaggc ccccctgggc 

721 cagacagccc gcaagtggct gcagctggcc atgttctgct ccccactggt gccagggcag 

781 acacacctgc agctgcgcgt ctacttcctg aacaacactc cctgcgccct gcagtgggct 

841 atcaccaatg aacagccgca cgggggacgc atgcgcgggc cgtgccagct cttcgacttc 

901 actggggcca gagcagacca gtgcctgaag ctcaagtaca tctctgaggg ttgggagaat 

961 gtggatgata gcagtagcca gctggttcca catctccaca tctggcatgg aaagtgcccc 

1021 ttccgttctt tctgcttccg gagaaaggca gccaatggga atgaagagtg ctcggcatta 

1081 accaatgaga tcattgtcac catgcacacc ttccaggatg gcttggaaac caaatacgtt 

1141 gaaatcctca gattccaggc atcggaggaa gagacctggg cagtgccccc tcctgtctcc 

1201 cagccacccc tgtgcaacag gctgccccca gagctctttg agcagctgca gatgttgttg 

1261 gagcccaaca gtgtcactgg gaatgactgg cgcagactgg cctcccacct gggtctctgt 

1321 ggcatgaaaa tccgggtctt gtcctgccag cgcagtcccg ccgcggccat tctggaattg 

1381 ttcgaggaac agaatggcag cttgcaggag ctgcactatc tcatgacctc catggagcgg 

1441 ctggactgcg cctctgccat ccagaactac ctaaaccggt ctccccgggg cagccctgac 

1501 aggttgcatg gaggcacctg ggagaaccat ggcctagagc tggatgagaa actctaa 
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LOCUS (LOC): BC057560 GenBank (R) 

GenBank ACC. NO. (GBN) : BC057560 

GenBank VERSION (VER) : BC057560.1 GI: 34785820 

CAS REGISTRY NO. (RN) : 588653-90-1 

SEQUENCE LENGTH (SQL) : 3672 

MOLECULE TYPE (ci) : mRNA; linear 

division CODE (ci): Rodents 

DATE (DATE): 21 Oct 2003 

DEFINITION (DEF) : Mus musculus unc-5 homolog B (C. eleqans), mRNA (cDNA 

clone MGC: 66787 IMAGE : 6417563) , complete cds. 

KEYWORDS (ST): MGC 

SOURCE: Mus musculus (house mouse) 

ORGANISM (ORGN): MUS musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 
NUCLEIC ACID COUNT (NA) : 783 a 1137 c 1074 g 678 t 
COMMENT : 

Contact: MGC help desk 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, university of Iowa 

cDNA Library Preparation: M. Bento Soares, University of Iowa 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA sequencing by: sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94305 

Web site: http://www-shgcstanford.edu 

Contact: (Dickson, Mark) mcd@paxil.stanford.edu 

Dickson, M., Schmutz, J., Grimwood, 3 . , Rodriquez, A., and Myers, 

R. M. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 125 Row: o Column: 12 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 21218439. 
REFERENCE: 1 (bases 1 to 3672) 

AUTHOR (AU): Strausberg,R.L. ; Feingold,E.A. ; Grouse, L.H.; 

Derge,J.G.; Klausner , R.D. ; Collins, F. s. ; Wagner, L.; 

Shenmen,C.M. ; Schuler ,G.D. ; Al tschul ,S . F. ; Zeeberg,B.; 

Buetow,K.H.; Schaefer ,C.F. ; Bhat,N.K.; Hopkins, R.F. ; 

Jordan, H.; Moore.T.; Max,S.I.; Wang, J.; Hsieh,F.; 

Diatchenko,L. ; Marusina,K. ; Farmer , A. A. ; Rubin, G.M. ; 

Hong,L.; stapleton,M. ; soares, M.B.; Bonal do ,M. F. ; 

Casavant,T.L. ; Scheetz,T.E. ; Brownstei n , M . J . ; 

usdin,T.B.; Toshiyuki ,S. ; Carm'nci,p.; Prange,c.; 

Raha.S.S.; Loquellano, N.A. ; Peters, G. J.; Abramson,R.D. ; 



TITLE (TI): 

JOURNAL (SO): 

OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



Munahy f S.J.; Bosak,S.A.; McEwan.P.J.; McKernan , K . J . ; 
Malek,J.A.; Gunaratne.P.H. ; Richards, S.; Worley.K.C; 
Hale.s.; Garcia, A.M.; Gay,L.J.; Hulyk,S.W.; 
villalon,D.K. ; Muzny,D.M. ; Sodergren , E . J . ; Lu,x.; 
Gibbs,R.A.; Fahey,J.; Helton, E. ; Ketteman,M. ; Madan,A. 
Rodrigues,S. ; Sanchez, A.; Whiting, M.; Madan,A.; 
Young, A. C; Shevchenko, Y. ; Bouffard,G.G. ; 
Blakesley, R.W. ; Touchman, J .W. ; Green, E.D.; 
Dickson, m. c. ; Rodriguez, A. C. ; Grimwood , J . ; Schmutz,J.; 
Myers.R.M.; Butterfield,Y.S. ; Krzywi nski , M . I . ; 
Skalska,u.; smailus,D.E. ; Schnerch,A.; Schein,J.E.; 
Jones, s. J.; Marra,M,A. 

Generation and initial analysis of more than 15,000 

full-length human and mouse CDNA sequences 

Proc. Natl. Acad. Sci . U.S.A., 99 (26), 16899-16903 

(2002) 

CA 138:84325 

2 (bases 1 to 3672) 

Strausberq,R. 

Direct Submission 

Submitted (03-SEP-2003) National Institutes of Health, 
Mammalian Gene Collection (mgc) , Cancer Genomics 
Office, National Cancer Institute, 31 Center Drive, 
Room 11A03, Bethesda, MD 20892-2590, USA 



FEATURES (FEAT): 

Feature Key Location 
======—=======+======—===== 

source 1..3672 



gene 



CDS 



1. .3672 



224. .3028 



Qualifier 



==+= 



/orqanism= n Mus musculus" 

/mol-type= ,, mRNA ,, 

/strain= M C57BL/6" 

/db-xref= n taxon : 10090" 

/cl one= M MGC : 66787 IMAGE : 6417563" 

/tissue-type="Brain, enriched 

mouse brain 12.5dpc" 

/cl one-1 i b="NIH-BMAP-Fl0" 

/lab-host="DHl0B" 

/note="vector: pYX-ASC" 

/gene= M unc5b" 

/db-xref="LocusID: 107449" 

/db-xref="MGI: 894703" 

/codon-start=l 

/product="Unc5b protein" 

/protei n-i d="AAH57560 . 1" 

/db«xref="Gl: 34785821" 

/db-xref= M LocusID : 107449" 

/transl ation="MRARSGVRSALLLALLLCWD 

PTPSLAGVDSAGQVLPDSYPSAPA 

EQLPYFLLEPQDAYIVKNKPVELHCRAFPATQIY 

FKCNGEWVSQNDHVTQES LDEATG 

LRVREVQIEVSRQQVEELFGLEDYWCQCVAWSSS 

GTTKSRRAYIRIAYLRKNFDQEPL 

AKEVPLDHEVLLQCRPPEGVPVAEVEWLKNEDVI 

DPAQDTNFLLTIDHNLIIRQARLS 

DTAN YTCVAKN IVAKRRSTTATVIVYVNGGWSSW 

AEWS PCSN RCGRGWQKRTRTCTN P 

APLNGGAFCEGQAFQKTACTTVCPVDGAWTEWSK 

WSACSTECAHWRSRECMAPPPQNG 

GRDCSGTLLDSKNCTDGLCVLTLETSGDVALYAG 

LWAVFVWAVLMAVGVIVYRRNC 

RD FDTDITDSSAALTGGFH PVN FKTARPNN PQL L 

HPSAPPDLTASAGIYRGPVYALQD 

SADKIPMTNSPLLDPLPSLKIKVYNSSTIGSGSG 

LADGADLLGVLPPGTYPGDFSRDT 

HFLHLRSASLGSQHLLGLPRDPSSSVSGTFGCLG 

GR LS L PGTGVS L LVPNGAI PQGK F 

YDLYLHINKAESTLPLSEGSQTVLSPSVTCGPTG 

L L LCR P W LTV PH CAE VI AGDWI F 

QLKTQAHQGHWEEWTLDEETLNT PC YCQ LEAKS 

CH I L LDQLGTYVFMGES YS RSAVK 

R LQ L A I FA PA LCTS L E YS L R VYC L E DT PVA LK E V 

LELERTLGGYLVEEPKPLLFKDSY 

HN LRLS LHDI PHAHWRSKLLAKYQEI PFYHVWNG 

SQRALHCTFTLERHSLASTEFTCK 

VCVRQVEGEGQIFQLHTTLAETPAGSLDALCSAP 

GNAITTQLGPYAFKIPLSIRQKIC 



misc-feature 698. .955 

misc-feature 968. .1123 

misc-feature 1136. .1282 

misc-feature 1811. .2122 

misc-feature 2744. .3016 



SSLDAPNSRGNDWRLLAQKLSMDRYLNYFATKAS 
PTGVI LDLWEARQQDDGDLN S LAS 
ALEEMGKSEMLVAMATDGDC" 
/ note="iG ; Regi on : Immunogl obul in" 
/db-xref = n CDD : smart00409" 
/note= ,, TSPl; Region: 
Thrombospondin type 1 repeats" 
/db-xref="CDD:smart00209'' 
/note="TSPl; Region: 
Thrombospondin type 1 repeats" 
/db-xref="CDD:smart00209" 
/note="zu5; Region: Domain present 
in ZO-1 and Unc5-like netrin 
receptors" 

/db-xref= M CDD:smart00218 M 
/ note="DEATH ; Region: DEATH 
domain, found in proteins involved 
in cell death (apoptosis). 
Alpha-helical domain present in a 
variety of proteins with apoptotic 
functions. Some (but not all) of 
these domains form homotypic and 
heterotypic dimers" 
/db-xref="CDD:smart00005" 

SEQUENCE (SEQ): 

1 gctgagagag ccgggatccc cagcggcgtc cgactcccgg agcgctccta gtcgccgggc 
61 ggcctcccgg cgctgcgcgg ttgcctctgc gcctacggag ggcacgggct ggcgctgccg 
121 ggcgcctgcg agaacggcga ggcggcggcg aaggcgaagg cggcgaggct ggggaccggg 
181 aaagaacccc gagggagagg cgcccgggcc gggggacagg agcatgaggg cccggagcgg 
241 ggtgcggagc gcgctgctgc tggcgctgct gctttgctgg gatccgacac cgagcctagc 
301 aggcgttgac tctgctggcc aggtgctccc agactcctac ccatcagccc ctgcggagca 
361 gctgccgtac ttcctattgg agccacagga cgcctacatc gtaaagaaca agccagtgga 
421 actgcactgc agagccttcc ctgccacgca gatctacttc aagtgtaatg gcgagtgggt 
481 cagccagaat gaccacgtca cacaggagag cctggatgag gccacaggct tgcgggtgcg 
541 agaggtgcag atcgaggtgt cacggcagca agtggaggaa ctcttcgggc tcgaggacta 
601 ctggtgccag tgcgtggcct ggagctcttc gggaactacc aagagtcgcc gagcctacat 
661 ccgcattgcc tacttgcgca agaactttga ccaggagcct ctggccaagg aggtaccctt 
721 ggatcatgag gtccttctgc agtgccgccc accggaggga gtgcctgtgg ctgaggtgga 
781 atggctcaag aatgaagatg tcattgaccc cgctcaggac actaacttcc tgctcaccat 
841 tgaccacaac ctcatcatcc gccaggcgcg cctctcagac acggccaact acacctgtgt 
901 ggccaagaat atcgtggcca agcgccggag caccacggcc acagtcatcg tctatgtgaa 
961 tggaggctgg tccagctggg cagagtggtc accctgttcc aatcgctgtg gccgaggctg 
1021 gcagaagcgt actcggacct gcaccaatcc agccccactc aatggaggcg ccttctgtga 
1081 gggacaggcc ttccagaaga cagcttgcac caccgtgtgc ccagtggatg gagcgtggac 
1141 cgagtggagc aagtggtctg cctgcagcac agagtgtgcg cactggcgca gccgcgagtg 
1201 catggcaccg ccaccccaga acggaggccg tgactgcagc gggacgctac ttgactccaa 
1261 gaactgcact gatgggctgt gcgtgctgac cctggagaca tcgggagatg tggcactgta 
1321 cgcaggcctt gtggtggccg tctttgtggt ggtagcggtt ctcatggccg tgggagtgat 
1381 cgtataccgg agaaactgcc gggacttcga cacggacatc accgactcct ctgcggccct 
1441 cactggtggc ttccaccctg tcaacttcaa gactgcaagg cccaacaacc cgcagctcct 
1501 gcacccgtcc gcccctccag acctaacggc cagtgctggc atctaccgcg ggcctgtgta 
tgccctgcag gactccgccg acaagatccc catgactaat tcgcccctgc tggatcccct 
1621 gcccagcctc aagatcaagg tctataactc cagcaccatc ggttctgggt ctggcctggc 
1681 tgatggagcc gacctgctgg gtgtcctccc gccgggcacg tacccaggcg atttctcccg 
1741 ggacacccat ttcctgcacc tgcgcagtgc cagccttggt tcccagcacc tcctgggcct 
acct(= ggg ac cccagcagca gtgtcagcgg cacctttggt tgcctgggag gaaggctgag 
cctcccc gg c acaggggtca gcctgttggt accaaatgga gccattcccc agggcaagtt 
nnoT ctat g acct g tatctacata tcaacaaggc cgaaagcacc ctcccacttt cagaaggttc 
^ } ccagacagta ttgagcccct cggtgacctg tgggcccaca ggcctactcc tgtgccgccc 
2041 tgtcgtcctc accgtgcccc actgtgctga agtcatcgct ggagactgga tctttcagct 
2101 caagacccag gcccatcagg gccactggga ggaggtggtg accttggatg aggagaccct 
2161 caacacaccc tgctactgcc agctggaggc taagtcctgc cacatcctgc tggaccagct 
2221 gggtacctac gtattcatgg gcgagtccta ctctcgctct gcagtcaagc ggctccagct 
2281 ggccatcttc gccccagccc tctgcacctc cctggagtat agcctcaggg tctactgtct 
2341 ggaggacaca cctgtagcac tgaaggaggt cctggagctg gagaggactc tgggtggcta 
2401 cttggtggag gagcccaagc ctttgctctt taaggacagt taccacaacc tacgcctctc 
2461 cctccatgac atcccccatg cccactggag gagcaaacta ctggccaagt accaggagat 
2521 tcccttctac cacgtctgga atggcagcca gagagccctg cactgcactt tcaccctgga 
2581 gaggcatagc ctggcctcca cggagttcac ctgtaaggtc tgcgtgcggc aggtcgaagg 
2641 ggaaggccag attttccagc tgcacacaac gttggccgag acgcctgctg gctccctgga 
2701 tgctctctgc tctgccccgg gcaatgccat caccacccag ctgggaccct atgccttcaa 
2761 gatacccctg tccatccgcc aaaagatctg cagcagcctg gacgccccca actcccgggg 
2821 caacgactgg aggctgttgg cgcagaagct gtccatggac cggtacctaa actacttcgc 
2881 caccaaagct agtcccacag gtgtcatctt agacctctgg gaagctcggc aacaggatga 
2941 cggggacctc aacagcctgg ccagtgcctt ggaggagatg ggcaagagtg agatgctggt 



3001 agccatggcc acagatggcg attgctgagt gcctgtgacc acaggcctgt ggggatcagt 

3061 aggagacggt gcaaggaggc ctggcagcct ctgcacaggg gtgcccagcc tccaccactc 

3121 ctggctcaca gcaggaatgg tccttcaact ccctccccgc cacaaccctc agaccaccac 

3181 caccagcctt agaaagtctc tgtgctctac tgccaagagg ccgggatcct ctggcccact 

3241 gtttccccag ctcactctgg ggtgggctga ggcctctggg acagctgaaa gccagaggct 

3301 ttcccctgcg acaacacacc accctcagcc ctgtgacttt ggggacccac aggtttcaat 

3361 tctgtgttca catggtcctg ggctagggac cgctctctta tcccgggtcg agttcagttc 

3421 aggcaaactg ctttttcctg tccacaagca gagagggaag attaggggag tgggggtggg 

3481 gggtggggga tgagcctcag aagtcagcga gactcaggta gtgagagagc aaaaacagta 

3541 agggcaaaga aagacccagt ttcttaggga acgcaaatga tttattatcc agatacttgg 

3601 atagttcctt tttaagaaaa caaaacaaac aaaaaaaagt aaaaaaaaaa aaaaaaaaaa 
3661 aaaaaaaaaa aa 
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LOCUS (LOC): BC058084 GenBank (R) 

GenBank ACC. NO. (GBN) : BC058084 

GenBank VERSION (VER) : BC058084.1 GI: 34784158 

CAS REGISTRY NO. (RN) : 588638-70-4 

SEQUENCE LENGTH (SQL): 3844 

MOLECULE TYPE (ci) : mRNA; linear 

division code (ci): Rodents 

DATE (DATE): 21 Oct 2003 

DEFINITION (DEF): Mus musculus unc-5 homolog A (c. eleqans), mRNA (cdna 

clone MGC: 66671 IMAGE : 6813463) , complete cds. 
KEYWORDS (ST) : MGC 

SOURCE: Mus musculus (house mouse) 

organism (orgn): Mus musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
sciurognathi ; Muridae; Murinae; Mus 
NUCLEIC ACID COUNT (NA) : 705 a 1298 c 1094 g 747 t 
COMMENT: 

Contact: MGC help desk 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 

cDNA Library Preparation: M. Bento Soares, University of Iowa 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

info@bcgsc.bc.ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
sen Lee, victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi , Jacqueline 
schein, Duane Smailus, Michael smith, Lorraine spence, Jeff stott, 
Michael Thorne, Miranada Tsai , Natasja van den Bosch, Jill vardy, 
George Yang, Scott Zuyderduyn, Marco Marra. 

clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. consortium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 126 Row: b Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA qi : 23346570. 
REFERENCE: 1 (bases 1 to 3844) 

AUTHOR (AU): Strausberg,R.L. ; Feingold,E.A. ; Grouse, L.H.; 

Derge,J.G.; Klausner ,R.D. ; Collins, F.S. ; Wagner, L.; 

Shenmen,C.M. ; schuler ,G.D. ; Al tschul ,S . F. ; Zeeberg.B.; 

Buetow,K.H. ; Schaefer,c.F. ; Bhat.N.K.; Hopkins, R.F. ; 

Jordan, H.; Moore, T.; Max,S.I.; Wang, J.; Hsieh,F.; 

Diatchenko,L. ; Marusina,K.; Farmer, A. A.; Rubin, G.M.; 

Hong,L.; stapleton.M. ; soares, M.B.; Bonal do ,M . F. ; 

Casavant,T.L. ; Scheetz,T.E. ; Brownstei n , M . J . ; 

Usdin.T.B.; Toshiyuki ,s. ; Carninci,P.; Prange,C; 

Raha,S.s.; Loquellano,N.A. ; Peters, G. J.; Abramson , r.d. ; 

Mullahy.S.J. ; Bosak,s.A.; McEwan,P.J.; McKernan.K. J . ; 

Malek,J.A.; Gunaratne , P.H . ; Richards, s.; Worley,K.c.; 

Hale,s.; Garcia, A. M. ; Gay,L.J.; Hulyk.S.W.; 

Villalon.D.K. ; Muzny,D.M. ; Sodergren , E . J . ; Lu,x.; 

Gibbs,R.A.; Fahey,j.; Helton, E. ; Ketteman.M.; Madan,A.; 

Rodrigues.s. ; Sanchez, A.; Whiting, M.; Madan,A.; 

Young.A.O; Shevchenko, Y. ; Bouffard,G.G. ; 

Blakesley,R.w. ; Touchman, J .W. ; Green, E.D.; 

Dickson, M. C. ; Rodriguez, A. C. ; Grimwood,J.; Schmutz,j.; 

Myers, R.M.; Butterfield, Y. s. ; Krzywinski ,M. I. ; 

Skalska,U.; Smail us , d. E. ; Schnerch.A. ; Schein,J.E.; 



TITLE (TI): 

JOURNAL (SO): 

OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) : 



Jones,S.J.; Marra,M.A. 

Generation and initial analysis of more than 15,000 

full-length human and mouse cDNA sequences 

Proc. Natl. Acad. Sci . U.S.A., 99 (26), 16899-16903 

(2002) 

CA 138:84325 

2 (bases 1 to 3844) 

strausberg.R. 

Direct Submission 

Submitted (08-SEP-2003) National Institutes of Health, 
Mammalian Gene Collection (MGC) , Cancer Genomics 
office, National cancer institute, 31 Center Drive, 
Room 11A03, Bethesda, MD 20892-2590, USA 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



source 



1. .3844 



gene 
CDS 



1. .3844 
252. .2780 



misc-feature 714. .926 

misc-feature 984. .1130 

misc-feature 1566. .1877 

misc-feature 2496. .2684 



/orqanism="Mus musculus" 

/moT-type= n mRNA" 

/strain="c57BL/6" 

/db-xref="taxon : 10090" 

/cl one="MGC : 66671 IMAGE : 6813463 " 

/tissue-type="Brain, mouse, 13.5, 

14.5, 16.5, 17.5 dpc" 

/cl one-1 i b="NlH-BMAP-FW0" 

/lab-host="DHl0B" 

/note="Vector: pYX-ASC" 

/gene="Unc5a" 

/db-xref="LocusID: 107448" 

/db-xref= M MGl:894682" 

/codon-start=l 

/product="unc5a protein" 

/protei n-i d="AAH58084 . 1" 

/db-xref="GI: 34784159" 

/db-xref="LocusID: 107448" 

/trans 1 ation="MAVRPGLWPALLGIVLTAWL 

RGSGAQQS ATVAN PVPGAN PD L L P 

HFLVEPEDVYIVKNKPVLLVCKAVPATQIFFKCN 

GEWVRQVDHVIERSTDGSSGLPTM 

E V R I N VS RQQ VE K V FG L E E YWCQCVAWS S S GTT K 

SQKAYIRIAYLRKNFEQEPLAKEV 

SLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSL 

DPNVYITREHS LWRQARLADTPN 

YTCVAKNIVARRRSASAAVIVYVDGSWSPWSKWS 

ACGLDCTHWRSRECSDPAPRNGGE 

ECRGADLDTRNCTSDLCLHTSSGPEDVALYIGLV 

AVAVCLILLLLVLVLIYCRKKEGL 

DS DVADS S I LTS G FQ PVS I K PS KADN PH L LTIQ P 

D LSTTTTTYQGS LCPRQDGPS PK F 

QLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRL 

STQNYFRSLPRGTSNMAYGTFNFL 

GGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHK 

PEDVRLPLAGCQTLLSPIVSCGPP 

GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEG 

SWEDVLHLGEESPSHLYYCQLEAG 

ACYVFTEQLGRFALVGEALSVAATKRLRLLLFAP 

VACTS L E YN I R V YC L H DTH D A L K E 

WQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHD 

V PS S LWKS K L L VS YQE I P F YH I WN 

GTQQYLH CT FT L E R VN ASTS D L ACK VWVWQVEGD 

GQS FN I N FN ITKDTR FAEM LAL ES 

EGGVPALVGPSAFKIPFLIRQKIITSLDPPCSRG 

ADWRTLAQKLHLDSHLSFFASKPS 

PTAMI LN LWEARH FPNGN LGQLAAAVAGLGQPDA 

GLFTVSEAEC" 

/note="lG; Region: Immunoglobul in" 
/db-xref="CDD:smart00409" 
/note="TSPl; Region: 
Thrombospondin type 1 repeats" 
/db-xref="CDD:smart00209" 
/note="ZU5; Region: Domain present 
in ZO-1 and Unc5-like netrin 
receptors" 

/db-xref="CDD:smart00218" 
/note=" DEATH; Region: DEATH 
domain, found in proteins involved 



in cell death (apoptosis). 
Alpha-helical domain present in a 
variety of proteins with apoptotic 
functions. Some (but not all) of 
these domains form homotypic and 
heterotypic dimers" 
/db-xref="CDD:smart00005" 

SEQUENCE (SEQ): 

1 gggctccgcg cggcggctgg ctcggtgcgc ggcgcggctg gcgctgcgct ccgccccggc 
61 tgcattgctg cgctccgggt gcccagggga gccacgcgcc gcgtgcgccc cgcagccggc 
121 cgcccggagg cagcgctgtc ctctggcatg ggccccgggg gcgccccgag gtggggctct 
181 cggctgaggc gctgacagcc tctctcccgc ccgcggggcc cctagtccag cccgctcgtc 
241 cgcccgcggc catggccgtc cggcccggcc tgtggccagc gctcctgggc atagtcctca 
301 ctgcctggct tcgtggttcg ggtgcccagc agagtgccac agtggccaac ccagtgcctg 
361 gtgccaaccc ggacctgctg ccccacttcc tggtagagcc ggaggacgtg tacattgtca 
421 agaacaagcc cgtgctgctg gtgtgcaagg ctgtgcccgc cacccagatc ttcttcaagt 
481 gcaacgggga atgggttcgc caggtcgatc acgtcattga acgcagcact gacggcagca 
541 gcggattgcc aaccatggag gtccggatca acgtatcaag gcagcaggtc gagaaagtgt 
601 ttgggctgga ggagtactgg tgccagtgtg tggcatggag ctcctcagga accaccaaaa 
661 gccagaaggc ctacatccgg attgcctatt tgcgcaagaa ctttgagcag gagccgctgg 
721 ccaaggaagt gtcactggag caaggcattg tgctaccttg tcgccccccg gaaggaatcc 
781 ccccagctga ggtggagtgg ctccgaaatg aggacctcgt ggacccctcc ctcgacccca 
841 atgtgtacat cacacgggag cacagcctag tcgtgcggca ggcccgcctg gccgacactc 
901 ccaactacac ctgcgtggcc aagaacatcg tggcccgtcg ccgaagcgcc tctgcggccg 
961 tcattgttta tgtggatggg agctggagcc catggagtaa gtggtcagcc tgcgggcttg 
1021 actgcaccca ctggcggagc cgggagtgct ccgacccagc gccccgcaac ggaggtgagg 
1081 agtgccgggg tgctgacctg gacacccgca actgtaccag tgacctctgc ctgcacacct 
1141 cttccggccc cgaggacgtg gctctctaca tcggcctcgt cgccgtggcc gtgtgcctca 
1201 tcttgctgct gctggtcctc gtcctcatct actgccgcaa gaaggaagga ctggactcag 
1261 acgtggctga ctcatccatc cttacctcag gcttccagcc tgtcagcatc aagcccagca 
1321 aagcagacaa tccccatctg ctcaccatcc aaccggacct cagcaccacc acgaccacct 
1381 accagggcag cctgtgtccc cggcaggatg gacccagccc caagttccag ctctctaatg 
1441 gtcacctgct cagcccactg ggcagtggcc gccatacgct gcaccacagc tcccccacct 
1501 ctgaggctga ggacttcgtc tcccgcctct ccacccaaaa ctactttcgt tctctgcccc 
1561 gcggtaccag caacatggcc tatgggacct tcaacttcct cgggggccgg ctgatgatcc 
1621 ctaacacagg aatcagcctc ctcatacccc cggacgccat cccccgagga aagatctacg 
1681 agatctacct cactctgcac aagccagaag acgtgaggtt gcccctagct ggctgtcaga 
1741 ccctgctgag tcctatcgtt agctgtgggc ccccaggagt cctgctcacc cggccagtca 
1801 tccttgccat ggaccactgc ggggagccca gtcccgacag ctggagcctg cgcctcaaaa 
1861 agcagtcctg tgagggcagc tgggaggacg tgctgcacct tggtgaggag tcgccctctc 
atctctacta ctgccagctg gaggccgggg cctgctatgt ctttaccgag cagctaggcc 
1981 gctttgccct ggtgggagag gccctcagcg tggctgccac caagcgcctc aggctccttc 
2041 tgtttgcccc tgtggcctgt acgtccctcg agtacaacat ccgagtgtac tgcctgcacg 
2101 acacccacga tgctctcaag gaggtggtgc agctggagaa gcagctgggt ggacagctga 
2161 tccaggagcc ccgtgtcctg cacttcaaag acagttacca caacctacgt ctgtccatcc 
2221 acgacgtgcc cagctccctg tggaagagca agctccttgt cagctaccag gagatccctt 
2281 tttaccacat ctggaatggc actcagcagt atctgcactg caccttcacc ctggagcgcg 
lilt} tcaat g cca g caccagcgac ctggcctgca aggtgtgggt gtggcaggtg gagggagatg 
2401 gacagagctt caacatcaac tttaacatca ctaaggacac gaggtttgct gaaatgctgg 
2461 ctctggagag tgaagggggg gtcccagccc tggtgggccc cagtgccttc aagatcccct 
^ro3" tcctcattc g gcaaaagatc attaccagcc tggacccacc ctgcagccgg ggcgccgact 
2581 ggcgaactct agcccagaaa cttcacctgg acagccatct tagcttcttt gcctccaagc 
ccagccctac agccatgatc ctcaacctat gggaggcgcg gcacttcccc aacggcaacc 
¥-tI} tc 99c<=agct ggccgcagct gtggccggac tgggccagcc agatgctggc ctcttcaccg 
tfftcagaggc cgagtgctga gaccagccag gccggtcacg cctacactct caccagcttt 
2821 ggcacctgcc agggacaggc aaaagccaga caggggccct acccccacac ccggggagag 
™a} ct g ctt gg ac aggcccccct cctggctgaa gttgtccctc gatgctggtc cttcagaccc 
2941 tgcccaaact ccatccctcc atggcctgcc tggccaggtt ggtttagcca cctgttctcg 
19^7 ctct g ccct g gtcccggggc ccagagtgga cagtgcctgg agcctgggct gagcccagcc 
3061 catctgtgtg tgtgtgtgta tgtgtgtgat gctacctctc ttcctgtccc ttgccagggg 
ccccgcatac acacagcatg cgcacacatg ctgggcttgg gacacggccc ccagagctcc 
3181 tgcctgaggt gggccttatg caaacatttc tgtgcctgct gggtaggggt ctatttgagg 
3241 ggcctggctt caagcctggg gggactaagg gtcccagctg gacaggggct ggcccttgga 
3301 ttcaggcaca cgatcaccac acaggcgtgt gttcatgcat gcctcgtgtg ctcatctcac 
3361 acgcacccct ctcccaggtc atgcaggacc cctcccccca ccacacacac atctcatgct 
3421 gtgcacccgg aggctgctca cgtctctcac acccggtgtc ggtgtcggta cacatctgcc 
Ica} tctcacat g c tgcccttctc ccacccaccc agggacaccc gacggctcct ccctgatcct 
IIt} ttccctgtac cccggcctcg aggtgccctg cccagcgggg cgtgtgaata tgcaatggga 
3o01 gtcccgggct gtacaatggc aagtgtgtgt gccgtggcgt gcccgttcct ggggctggcc 
3661 aatgcccctg tgtggggcct gttgtgtgaa gcttgtgtcc tgactctgtc ttaagtgcat 
III} tcct g cact t acacttggcc ttatgtacac agccttgccc ggctgccggg gcacgtaggg 
3781 attttagcgg atgtgaatgt aaataaatta tatatatata ttgctaaaaa aaaaaaaaaa 
3841 aaaa 
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LOCUS (LOC): 

GenBank ACC. no. (gbn) 
GenBank version (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF): 

SOURCE: 
ORGANISM (ORGN): 



NUCLEIC ACID COUNT (NA) 

reference: 

AUTHOR (AU) : 
TITLE (TI): 

JOURNAL (SO) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO) : 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO): 



AB114633 GenBank (r) 
AB114633 

AB114633.1 GI: 33438235 

562037-10-9 

3314 

mRNA; linear 
Other vertebrates 
5 Aug 2003 

Danio rerio SH3BP4 mRNA for src homology 3 binding 
protein 4, complete cds. 
Danio rerio (zebrafish) 
Danio rerio 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Eutel eostomi ; Actinopterygii ; Neopterygii; Teleostei; 
ostariophysi ; Cypri ni formes ; Cyprinidae; Danio 
: 959 a 746 c 759 g 850 t 
1 

Abe.s.; Doi ,M . ; Nakagawa.T. 

Structural and phylogenetic analyses of the SH3BP4 

cDNAs in fish and human 

Unpublished 

2 

Abe,S.; Nakagawa,T.; Wang, P. 

Danio rerio cDNA for SH3BP4 long form, complete CDS 
Published only in Database (2003) 
3 (bases 1 to 3314) 
Abe,S. 

Direct Submission 

submitted (14-JUL-2003) shunnosuke Abe, Ehime 
university, Laboratory of Molecular Cell Biology, 
Department of Bioresources , Faculty of Agriculture; 
3-5-7 Tarumi, Matsuyama city, Ehime Prefecture 7908566, 
Japan (E-mail :abe@mcb. agr.ehime-u.ac.jp, 
URL : http : //web-mcb . agr . ehi me-u .ac.jp/, 
Tel :81-89-946-9853, Fax:81-89-977-4364) 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



+ 1..3314 



1. .3314 
146. .2956 



source 



gene 
CDS 



/orqanism= M Danio rerio" 

/moT-type="mRNA" 

/db-xref="taxon:7955" 

/sex="mixed" 

/dev-stage=" Juvenile" 

/gene="SH3BP4" 

/gene="SH3BP4" 

/codon-start=l 

/product="src homology 3 binding 
protein 4" 

/protei n-i d="BAC81503 . 1" 

/db-xref="Gl: 33438236" 

/translation="MAAHRIRVTNNNNVLPRCKS 

EGTLIDIDEGVTEASLVDVKVPSP 

SALRLAPSTSFGTAREWAIKDYCPSSFTTLKFS 

KGDHLYVLDTSGGEWWYAHNNREM 

GYIPSTYVQPINYRNSSLSDSGMIDNLGDCSEEG 

AKE LDL LGEWTGVCLK PS L PYDNN 

NPFSMLSSTNPFLNGSMDDVLDQKGTALVASMGH 

PPALDLEALADASQGTIPFFGSKR 

SYSLSELSTLQAQSNPPLPSSGFFTGLKAPSPEQ 

FQNREDFRTAWLNHRKLARSCHDL 

ES LGQN PGWGQTQ PVETN I VCK LDS YGGAVQ L PD 

TNISIHVPEGHVAVGDTQQISMKA 

LLDPPLELNNDRCSTVS PWEIKLSNMEIKSFIT 

LEMKVSVMTRTES PMAEIVCVRSD 

CKEGPYSPVPQAYIYDDTIQVQLDNLEPCMYVAV 

WQAKHVSFSSNVWNHMVKKVTLG 

LYGPKHIH PS FKTWAI FGHDCAPKTLLVS EVGK 

QAKSAPPVSLQLWGKHQFVLSRPQ 

DLQIGMYSNMSNYEVKANDQARWRGFQIKLGKV 

SRLIYMISSRKADEISDFTLRVQV 

KDDQDCILAQFCVQTPQPPPKTGMRNNGQRRFLK 

KKEVDKIILSPLAITTKYPQFQER 

CITNLKFGKLIKTVIRQTKNQYLLEYKKGDIIAL 

LSEEKIKLKGQLWTKEWYIGYYQG 



misc-feature 320. .484 



misc-feature 1042. .1330 



misc-feature 2441. .2737 



polyA-signal 3285. .3290 
polyA-site 3302 



KIGLVHVKNVLVLGKVKPIYFCGPDLTTTMLMEQ 

I LKPCKFLTYI YASVRTI LMEN LG 

NWRAFADALSYVNLPLTYFCRAELDSEPERVASV 

LEKLKEDCLNMETKEKKSFQKELM 

M A L L K I DCQG L VAR L I M D FV L LTTAVE VA P RWR D 

L A E K L AH VS KQQM E A Y E A PH R D KT 

GMVDS EAMWK PA YD F L LTWAAQIGDSYRDVI H E L 

HMGLDRMKNPITKRWKHLTGTLIL 

VNCLDLLRSSAFSPAPQDDFAI" 

/gene="SH3BP4" 

/note="Region encoding SH3 (Src 
homology 3) domain (pfam00018) n 
/gene= SH3BP4" 

/note= M Region encoding ZU5, ZU5 
domain. Domain present in zo-1 and 
Unc5-like netrin receptors Domain 
(pfaqm00791) M 
/gene= M SH3BP4" 

/note= n Region encoding PIP5K 

domain (partial), pfam01504 n 

/gene="SH3BP4" 

/gene= M SH3BP4" 

/evi dence=exper i mental 



SEQUENCE (SEQ): 

1 aaacaaagtg 
61 caggaatgca 
121 ggtgaaatag 
181 caatgtgctc 
241 tgaggcgagt 
301 gacttcattt 
361 taccacctta 
421 atggtggtac 
481 aatcaactac 
541 ttcggaggag 
601 accctccctg 
661 tctaaatggg 
721 gggacatcca 
781 ttttttcggg 
841 aaatcctcca 
901 gtttcagaac 
961 ttgccatgat 
1021 aaccaatatt 
1081 tatcagtatt 
1141 gaaggctcta 
1201 agtggttgaa 
1261 ggtatcagtt 
1321 ttgcaaagag 
1381 agttcagctg 
1441 tgtttccttc 
1501 tgggcccaaa 
1561 tgcccctaag 
1621 ttccctacag 
1681 aggcatgtat 
1741 tcgtgggttt 
1801 gaaagcagat 
1861 tatccttgcg 
1921 caatgggcaa 
1981 agcaatcacc 
2041 caaactgatc 
2101 aggggatatc 
2161 caaagaatgg 
2221 gcttgtgttg 
2281 gctgatggag 
2341 gacaatactt 
2401 gaatcttcct 
2461 ctctgtgctg 
2521 ttttcagaag 
2581 gctcatcatg 
2641 tctcgccgaa 
2701 cagagataaa 
2761 gttgacgtgg 
2821 gggcctggac 
2881 tatcctcgtc 
2941 tgactttgcc 
3001 gaccgtgctg 



aagtgagcaa 

gaggacaaaa 

caaaaccgat 

cctaggtgta 

ctcgtagatg 

gggactgcac 

aagttctcca 

gcccacaaca 

aggaactcat 

ggcgcaaaag 

ccttatgaca 

tctatggatg 

ccagcgcttg 

agcaagagat 

ttgccttcat 

agagaggact 

cttgaatctc 

gtctgcaagt 

catgttcccg 

ctagatcctc 

atcaaactga 

atgactagga 

ggtccttatt 

gacaacttgg 

agttcaaatg 

catattcacc 

actctactag 

ctatggggga 

tcaaacatgt 

cagatcaaac 

gagatttctg 

caattttgtg 

aggcggtttc 

accaaatacc 

aaaaccgtta 

atcgcccttc 

tacattggat 

ggaaaggtta 

caaatcctaa 

atggaaaatc 

ttgacgtatt 

gagaaactaa 

gaactaatga 

gactttgtgc 

aaactggccc 

accggcatgg 

gctgctcaga 

cgaatgaaga 

aactgcctcg 

atttaatctc 

aacttttatt 



ctcaacacta 

tggaaatctg 

gcaagatggc 

aatcagaggg 

tcaaagtgcc 

gggaagttgt 

aaggggatca 

accgagaaat 

ctcttagtga 

agctagacct 

acaacaaccc 

atgtcctcga 

atctcgaagc 

cctacagttt 

cagggttttt 

tcaggacagc 

ttgggcagaa 

tggacagcta 

aaggccatgt 

ctttggaact 

gcaacatgga 

cggaaagtcc 

ctcctgttcc 

agccctgcat 

tatggaatca 

cgtcttttaa 

taagtgaggt 

agcaccaatt 

caaactatga 

taggcaaggt 

acttcacgct 

ttcaaacacc 

ttaagaagaa 

cacagtttca 

tcagacagac 

ttagtgagga 

actatcaggg 

agcctatata 

agccctgcaa 

tgggaaactg 

tttgcagagc 

aggaagattg 

tggctcttct 

tgctgacgac 

acgtatccaa 

tggacagtga 

tcggcgacag 

atcccatcac 

acttgcttcg 

ccaccccctc 

ggatattcac 



caactgcggg 

ccagtgtttg 

tgcccacagg 

cactcttatt 

ttctcccagt 

tgccattaag 

tctctatgtg 

gggatacatt 

cagtggaatg 

gctaggggag 

tttctccatg 

tcaaaagggt 

cttggcagat 

atcggagtta 

tacaggactt 

atggctaaac 

cccaggctgg 

tggaggggca 

tgcagtagga 

gaacaatgac 

gataaagtca 

aatggcggag 

acaagcttac 

gtatgtcgca 

tatggtgaag 

aactgttgtg 

tgggaagcaa 

tgtacttagc 

ggtcaaggca 

gagcaggctc 

acgtgtccaa 

ccagccgccc 

ggaagttgat 

agaacgctgc 

caagaatcag 

gaagatcaaa 

aaagattgga 

cttctgtggg 

gtttctcact 

gcgagcattt 

agagctggat 

cctcaacatg 

gaaaatcgac 

ggctgttgaa 

gcaacaaatg 

ggccatgtgg 

ttaccgcgac 

caagcgctgg 

aagctccgcc 

ctaatatttg 

ttttctacgt 



gctgctttgg 

attatcaggt 

atcagagtaa 

gatattgatg 

gccttgcgtt 

gattactgtc 

ctggacacat 

ccttcaacat 

attgacaatc 

tggactggag 

ctgtcctcaa 

acagcattag 

gcttcgcaag 

tcaaccttac 

aaagctccct 

caccgaaagt 

ggtcaaacac 

gtacagctcc 

gacactcagc 

agatgctcca 

ttcattacct 

attgtatgtg 

atttatgatg 

gttgttgtcc 

aaagttacat 

gccatttttg 

gcaaaatctg 

agacctcagg 

aatgatcaag 

atctacatga 

gtaaaagatg 

cctaaaacag 

aaaataattt 

attaccaact 

tacctgctgg 

ctcaaaggac 

cttgtgcatg 

ccagacctta 

tacatttatg 

gctgatgctc 

agtgaacccg 

gagaccaagg 

tgtcagggtt 

gtggctcctc 

gaggcctatg 

aaaccggctt 

gtcatccatg 

aaacacctca 

tttagccccg 

cttgtatgtc 

caccgtcagc 



ataaacatat 

ttggaggaat 

ccaacaacaa 

agggagtgac 

tggccccatc 

catccagttt 

ctggagggga 

atgtccagcc 

ttggggactg 

tgtgtctgaa 

ccaacccatt 

tggcttcgat 

ggacaatccc 

aagctcagtc 

cgccagaaca 

tggcacgatc 

aaccagtaga 

cagatacgaa 

aaatatctat 

ctgtcagtcc 

tagagatgaa 

tgagaagtga 

ataccataca 

aggctaaaca 

tgggcctcta 

gccatgactg 

caccccctgt 

acttacagat 

ctagggttgt 

tttcatcacg 

atcaggattg 

ggatgcggaa 

tgtctccttt 

tgaagtttgg 

aatataaaaa 

aactatggac 

taaaaaatgt 

caaccacaat 

cctctgtgag 

tgagctacgt 

agagagttgc 

agaagaagtc 

tggtggcgcg 

gctggaggga 

aggctccaca 

acgacttcct 

aactgcacat 

ccggcaccct 

cccctcagga 

acctcacgaa 

tgtgagttta 



3061 cctcaatcca ttagccacaa ggacttcctg tacccgccca cgtccatata agggcatgcg 

3121 gaaagccaat aaagccaaaa cacagagtgt tttacgtgta gctatttttc gagatttccg 

3181 aaggtgtttt tatccatttt tgttctttgc gatatctata ttttttatat tggatggcac 

3241 aaaataaatt tttaccattt ttttcaataa caattttttc agataataaa agaaatctga 
3301 ataaaaaaaa aaaa 
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LOCUS (LOC) : 

GenBank ACC. NO. (GBN) 

GenBank VERSION (VER) : 

CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) I 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF): 

SOURCE: 
ORGANISM (ORGN): 



NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 

AB104885 GenBank (R) 
AB104885 

AB104885.1 Gl: 33438221 

562036-96-8 

2808 

mRNA; linear 
Other vertebrates 
5 Aug 2003 

Danio rerio SH3BP4 mRNA for truncated SH3 binding 
domain protein 4, complete cds. 
Danio rerio (zebrafish) 
Danio rerio 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Actinopterygii ; Neopterygii; Teleostei; 
Ostariophysi ; Cypri ni formes ; Cyprinidae; Danio 
:^827 a 629 c 613 g 739 t 

Abe,s. ; Nakagawa,T. 

Danio rerio mRNA for tr-SH3BP4 (truncated SH3 binding 

protein 4) short form 

Published Only in Database (2003) 

2 (bases 1 to 2808) 

Abe,S. 

Direct Submission 

Submitted (04-MAR-2003) Shunnosuke Abe, Ehime 
University, Laboratory of Molecular Cell Biology, 
Department of Bioresources , Faculty of Agriculture; 
3-5-7 Tarumi, Matsuyama city, Ehime Prefecture 7908566, 
Japan (E-mail :abe@mcb. agr.ehime-u.ac.jp, 
URL : http : //web-mcb . agr . ehi me-u .ac.jp/, 
Tel :81-89-946-9853, Fax:81-89-977-4364) 



FEATURES (FEAT) 
Feature Key 



Location 



Qualifier 



source 



gene 
CDS 



1. .2808 



1. .2808 
146. .2419 



/organism="Danio rerio" 

/moT-type^mRNA" 

/db-xref="taxon:7955" 

/dev-stage= n Adult" 

/gene="SH3BP4" 

/gene="SH3BP4" 

/note="This variant of SH3BP4 in 
Danio rerio lacks Death 
domain(pfam00531) , which 
identified in SH3BP4 from five-ray 
yellowtail (AB086184) and commonly 
found in those from other species 
tr-SH3BP4" 
/codon-start=l 

/product= n truncated SH3 binding 

domain protein 4" 

/protei n-i d="BAC81502 . 1" 

/db-xref="Gi: 33438222" 

/transl ation="MAAHRIRVTNNNNVLPRCKS 

EGTLIDIDEGVTEAS LVDVKVPS P 

SALRLAPSTSFGTAREWAIKDYCPSSFTTLKFS 

KGDHLYVLDTSGGEWWYAHNNREM 

GYIPSTYVQPINYRNSSLSDSGMIDNLGDCSEEG 

AKELDLLGEWTGVCLKPSLPYDNN 

NPFSMLSSTNPFLNGSMDDVLDQKGTALVASMGH 

PPALDLEALADASQGTIPFFGSKR 

SYSLSELSTLQAQSNPPLPSSGFFTGLKAPSPEQ 

FQNREDFRTAWLNHRKLARSCHDL 

ESLGQNPGWGQTQPVETNIVCKLDSYGGAVQLPD 

TNISIHVPEGHVAVGDTQQISMKA 

LLDPPLELNNDRCSTVS PWEIKLSNMEIKSFIT 

LEMKVSVMTRTESPMAEIVCVRSD 

CKEGPYSPVPQAYIYDDTIQVQLDNLEPCMYVAV 



misc-feature 320. .484 



misc-feature 1043. .1330 



misc-feature 1694. .2077 



polyA-signal 2779. .2784 

SEQUENCE (SEQ): 

1 aaacaaagtg aagtgagcaa ctcaacacta 

61 caggaatgca gaggacaaaa tggaaatctg 

121 ggtgaaatag caaaaccgat gcaagatggc 

181 caatgtgctc cctaggtgta aatcagaggg 

241 tgaggcgagt ctcgtagatg tcaaagtgcc 

301 gacttcattt gggactgcac gggaagttgt 

361 taccacctta aagttctcca aaggggatca 

421 atggtggtac gcccacaaca accgagaaat 

481 aatcaactac aggaactcat ctcttagtga 

541 ttcggaggag ggcgcaaaag agctagacct 

601 accctccctg ccttatgaca acaacaaccc 

661 tctaaatggg tctatggatg atgtcctcga 

721 gggacatcca ccagcgcttg atctcgaagc 

781 ttttttcggg agcaagagat cctacagttt 

841 aaatcctcca ttgccttcat cagggttttt 

901 gtttcagaac agagaggact tcaggacagc 

961 ttgccatgat cttgaatctc ttgggcagaa 

1021 aaccaatatt gtctgcaagt tggacagcta 

1081 tatcagtatt catgttcccg aaggccatgt 

1141 gaaggctcta ctagatcctc ctttggaact 

1201 agtggttgaa atcaaactga gcaacatgga 

1261 ggtatcagtt atgactagga cggaaagtcc 

1321 ttgcaaagag ggtccttatt ctcctgttcc 

1381 agttcagctg gacaacttgg agccctgcat 

1441 tgtttccttc agttcaaatg tatggaatca 

1501 tgggcccaaa catattcacc cgtcttttaa 

1561 tgcccctaag actctactag taagtgaggt 

1621 ttccctacag ctatggggga agcaccaatt 

1681 aggcatgtat tcaaacatgt caaactatga 

1741 tcgtgggttt cagatcaaac taggcaaggt 

1801 gaaagcagat gagatttctg acttcacgct 

1861 tatccttgcg caattttgtg ttcaaacacc 

1921 caatgggcaa aggcggtttc ttaagaagaa 

1981 agcaatcacc accaaatacc cacagtttca 

2041 caaactgatc aaaaccgtta tcagacagac 

2101 aggggatatc atcgcccttc ttagtgagga 

2161 caaagaatgg tacattggat actatcaggg 

2221 gcttgtgttg ggaaaggtta agcctatata 

2281 gctgatggag caaatcctaa agccctgcaa 

2341 gacaatactt atggaaacac ctcaccggca 

2401 ttcgaagctc cgcctttagc cccgcccctc 

2461 cctcctaata tttgcttgta tgtcacctca 

2521 tcacttttct acgtcaccgt cagctgtgag 



WQAKHVSFSSNVWNHMVKKVTLG 

LYGPKHIHPSFKTWAIFGHDCAPKTLLVS EVGK 

QAKSAPPVSLQLWGKHQFVLSRPQ 

DLQIGMYSNMSNYEVKANDQARWRGFQIKLGKV 

SRLIYMISSRKADEISDFTLRVQV 

KDDQDCILAQFCVQTPQPPPKTGMRNNGQRRFLK 

KKEVDKIILSPLAITTKYPQFQER 

CITNLKFGKLIKTVIRQTKNQYLLEYKKGDIIAL 

LSEEKIKLKGQLWTKEWYIGYYQG 

KIGLVHVKNVLVLGKVKPIYFCGPDLTTTMLMEQ 

I LKPCKFLTYI YASVRTI LMETPH 

RHPYPRQLPRLASKLRL" 

/gene="SH3BP4 M 

/product= M truncated SH3 binding 
domain protein 4 (tr-SH3BP4)" 
/note= ,! Region encoding SH3 (Src 
homology 3) domains (pfam00018)" 
/gene= SH3BP4" 

/product="truncated SH3 binding 
domain protein 4 (tr-SH3BP4) M 
/note="Region encoding ZU5, ZU5 
domain. Domain present in ZO-1 and 
Unc5-like netrin receptors Domain 
(pfaqm00791) " 
/gene= n SH3BP4" 

/product="truncated SH3 binding 
domain protein 4 (tr-SH3BP4) n 
/note="Region encoding 
Transposase-2 , Probable 
transposase (pfam01385) M 
/gene="SH3BP4" 



caactgcggg gctgctttgg ataaacatat 
ccagtgtttg attatcaggt ttggaggaat 
tgcccacagg atcagagtaa ccaacaacaa 
cactcttatt gatattgatg agggagtgac 
ttctcccagt gccttgcgtt tggccccatc 
tgccattaag gattactgtc catccagttt 
tctctatgtg ctggacacat ctggagggga 
gggatacatt ccttcaacat atgtccagcc 
cagtggaatg attgacaatc ttggggactg 
gctaggggag tggactggag tgtgtctgaa 
tttctccatg ctgtcctcaa ccaacccatt 
tcaaaagggt acagcattag tggcttcgat 
cttggcagat gcttcgcaag ggacaatccc 
atcggagtta tcaaccttac aagctcagtc 
tacaggactt aaagctccct cgccagaaca 
atggctaaac caccgaaagt tggcacgatc 
cccaggctgg ggtcaaacac aaccagtaga 
tggaggggca gtacagctcc cagatacgaa 
tgcagtagga gacactcagc aaatatctat 
gaacaatgac agatgctcca ctgtcagtcc 
gataaagtca ttcattacct tagagatgaa 
aatggcggag attgtatgtg tgagaagtga 
acaagcttac atttatgatg ataccataca 
gtatgtcgca gttgttgtcc aggctaaaca 
tatggtgaag aaagttacat tgggcctcta 
aactgttgtg gccatttttg gccatgactg 
tgggaagcaa gcaaaatctg caccccctgt 
tgtacttagc agacctcagg acttacagat 
ggtcaaggca aatgatcaag ctagggttgt 
gagcaggctc atctacatga tttcatcacg 
acgtgtccaa gtaaaagatg atcaggattg 
ccagccgccc cctaaaacag ggatgcggaa 
ggaagttgat aaaataattt tgtctccttt 
agaacgctgc attaccaact tgaagtttgg 
caagaatcag tacctgctgg aatataaaaa 
gaagatcaaa ctcaaaggac aactatggac 
aaagattgga cttgtgcatg taaaaaatgt 
cttctgtggg ccagacctta caaccacaat 
gtttctcact tacatttatg cctctgtgag 
cccttatcct cgtcaactgc ctcgacttgc 
aggatgactt tgccatttaa tctcccaccc 
cgaagaccgt gctgaacttt tattggatat 
tttacctcaa tccattagcc acaaggactt 



2581 cctgtacccg cccacgtcca tataagggca tgcggaaagc caataaagcc aaaacacaga 
2641 gtgttttacg tgtagctatt tttcgagatt tccgaaggtg tttttatcca tttttgttct 
2701 ttgcgatatc tatatttttt atattggatg gcacaaaata aatttttacc atttttttca 
2761 ataacaattt tttcagataa taaaagaaat ctgaataaaa aaaaaaaa 



L2 ANSWER 91 OF 104 



GENBANK . RTM . COPYRIGHT 2004 on STN 



LOCUS (loc): 
GenBank ACC. 



NO. 



GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



AY187310 
(GBN): AY187310 



GenBank (R) 



AY187310.1 Gl: 31442350 
528194-13-0 
2962 

mRNA; linear 
other vertebrates 
6 Jun 2003 

Gall us gall us ***UNC5*** 
complete cds. 
Gall us gall us (chicken) 
Gall us gall us 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Archosauria; Aves; Neognathae; 
Gall i formes; Phasianidae; Phasianinae; Gall us 
NUCLEIC ACID COUNT (NA) : 746 a 783 c 803 g 630 t 



-like protein 3 mRNA, 



SOURCE: 
ORGANISM (ORGN) : 



REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) : 



1 (bases 1 to 2962) 
Guan,W.; Condic,M.L. 

Characterization of Netrin-1, Neogenin and CUNC-5H3 
expression during chick dorsal root ganglia development 
Gene Expr. Patterns, 3, 369-373 (2003) 
CA 139:320285 

2 (bases 1 to 2962) 
Guan,W.; Condi c,M.L. 
Direct submission 

Submitted (26-NOV-2002) Neurobiology & Anatomy, 
university of Utah, 20 North, 1900 East, Salt Lake 
City, UT 84132-3401, USA 



FEATURES (FEAT): 

Feature Key Location 

source 1..2962 



CDS 



1..2796 



Qualifier 

/orqanism="Gallus gall us" 
/moT-type= ,, mRNA ,, 
/db-xref="taxon:9031" 
/codon-start=l 

/product= M UNC5-like protein 3" 

/protein-id="AA067275.1" 

/db-xref="GI : 31442351" 

/translation="MGKGLEGTAARCGLGMGYLL 

H S WL PA LAVLGAS R PGS AAQDDD 

FFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKP 

VNLYCKAS PATQIYFKCNSEWVHQ 

KDHWDERVDETSGLIVCEVSIEISRQQVEELFG 

PEDYWCQCVAWS SAGTTKS RKAYV 

RIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGI 

PVAEVEWLKNEEVIDPVEDRNFYI 

TIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTT 

ATVIVYVNGGWSTWTEWSACNSRC 

GRGFQKRTRTCTNPAPLNGGAFCEGQNVQKIACT 

T LC PVDGKWTS WS KWSTCGTECTH 

WRRRECTAPAPKNGGKDCEGLVLQSKNCTDGLCM 

QAAPDSDDVALYVGIVIAVIVCLA 

ISVWALFVYRKNHRDFESDIIDSSALNGGFQPV 

NIKAARQDLLAVPPDLTSAAAMYR 

GPVYALHDVSDKIPMTNSPILDPLPNLKIKVYNT 

SGAVTPQDELSDFSSKLSPQITQS 

LLENETLNVKNQSLARQTDPSCTAFGTFNSLGGH 

L VI PN S GVS L L I PAGAV PQGRV YE 

MYVTVHRKEGMRPPVEDSQTLLTPWSCGPPGAL 

LTRPWLTMHHCAEPNMDDWQIQL 

KHQAGQGPWEDWWGEEN FTTPCYIQ LDP EACH 

ILTETLSTYALVGQSITKAAAKRL 

KLAIFGPLSCSSLEYSIRVYCLDDTQDALKEVLQ 

L E RQMGGQ L L E E PKT LH FKGSTH N 

LRLSIHDIAHSLWKSKLPAKYQEIPFYHIWSGCQ 

RNLHCTFTLERFSLNTLELVCKLC 

VRQVEGEGQIFQLNCSVSEEPTGIDYPIMDSAGS 

ITTIVGPNAFSIPLPIRQKLCSSL 



DAPQTRGHDWRMLAHKLKLDRYLNYFATKSSPTG 
VILDLWEAQNFPDGNLSMLAAVLE 
EMGRHETWS LAAEGNY" 



SEQUENCE (SEQ): 

1 atggggaagg ggctggaggg cacggcggcc 
61 cacagcgtgg tgctcccggc actggccgtc 
121 caagatgatg atttttttca tgaacttcca 
181 ttgccccact ttctcattga acccgaagaa 
241 ctgtactgca aagcgagccc tgccacgcag 
301 catcagaagg atcatgtggt ggatgagaga 
361 gaggtgagca tcgagatttc ccgccagcag 
421 tggtgccagt gtgtcgcctg gagctcagct 
481 cgcattgcat atctcagaaa gacttttgag 
541 gagcaagagg tcctgctcca gtgccgtcct 
601 tggctgaaga atgaagaggt gatcgatcct 
661 gatcacaacc tgatcatcaa gcaagcccgg 
721 gccaaaaaca ttgtggccaa aaggaaaagc 
781 ggaggctggt ctacctggac cgagtggtca 
841 cagaagcgca caaggacctg cactaaccct 
901 gggcaaaatg ttcagaaaat agcttgcacc 
961 tcctggagca agtggtccac ttgtggcaca 
1021 acagctccgg ccccgaagaa tggaggcaag 
1081 aactgcactg atgggctctg catgcaggct 
1141 gtggggattg tcattgctgt gattgtgtgc 
1201 gtctatcgca agaaccaccg tgactttgag 
1261 gggggatttc agcctgttaa catcaaggct 
1321 gacctcactt ctgctgcagc catgtacagg 
1381 gataaaatcc caatgaccaa ttctccgatc 
1441 gtttataaca cctctggagc agtcaccccc 
1501 ctgtccccac agattaccca gtctctgttg 
1561 agccttgcac ggcaaacaga cccatcctgc 
1621 ggccacctag taattcctaa ttcaggagtg 
1681 caaggaagag tctatgaaat gtatgtgaca 
1741 gtagaagaca gccagacgct gctgacacca 
1801 ctgacccgac ccgttgtgct gaccatgcac 
1861 cagatccagc tcaagcacca ggcaggccag 
1921 gaggaaaact tcaccactcc atgctacatc 
1981 acggagaccc tcagcacgta cgccttggtg 
2041 cgtctcaaat tggccatctt tggaccactg 
2101 gtctactgcc tcgatgacac acaggatgcc 
2161 atgggtgggc agctgttgga ggaacccaaa 
2221 ctgcgcttat ccattcatga cattgcccac 
2281 taccaggaga ttccttttta ccacatctgg 
2341 ttcacgctgg aacgattcag tctcaatacc 
2401 caagtcgaag gagaagggca gatcttccag 
2461 ggcattgatt atcccatcat ggattcagca 
2521 gctttcagca tccccctccc aataaggcag 
2581 acccggggcc atgactggag gatgctggcc 
2641 tattttgcta cgaagtcgag tcccactggg 
2701 ttccctgatg gcaacctgag catgctggca 
2761 accgttgttt ctttggcagc agaaggaaat 
2821 gacagacgca gggagcggta atgccctgtt 
2881 caatgagtta cacaagagac atttcagaga 
2941 acagccctga ccttcacatg tg 
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cgctgcgggc 
ctgggggcca 
gaaacttttc 
gcttacatcg 
atctatttta 
gtagatgaaa 
gtggaagagc 
ggcaccacca 
caggagccgc 
cctgaaggca 
gtggaagacc 
ctttccgaca 
acgacagcaa 
gcgtgcaaca 
gccccactca 
accctgtgtc 
gagtgtaccc 
gactgtgagg 
gcacctgact 
ctggctattt 
tcagatatta 
gcaagacaag 
gggcctgtgt 
ctggacccac 
caggatgaac 
gagaatgaga 
actgcatttg 
agcttgctga 
gtccacagga 
gtggtgagct 
cactgtgctg 
ggaccatggg 
cagctggacc 
ggacaatcca 
tcctgttcct 
ctgaaggagg 
actttgcatt 
tctctctgga 

agtgggtgcc 

ctggagctcg 
ctgaactgct 
ggcagcatca 
aagctctgca 
cacaagctga 
gtgatcctgg 
gcagtgctgg 
tactgatgca 
attgcaagca 
agaaagcaag 



tgggaatggg 
gccggcccgg 
cttctgatcc 
tgaaaaacaa 
agtgcaacag 
cctctggtct 
tctttggacc 
agagccgcaa 
tggggaaaga 
ttccagtagc 
gaaattttta 
cggctaacta 
ctgtgattgt 
gccgctgtgg 
atgggggggc 
cagtggatgg 
actggcgccg 
gactggtgct 
cggatgatgt 
ctgtggttgt 
tcgactcatc 
acctcttggc 
atgccttgca 
tgcccaatct 
tctctgactt 
ctctgaacgt 
ggaccttcaa 
tcccagcagg 
aggagggcat 
gtggcccacc 
agcccaacat 
aggatgtagt 
cagaggcctg 
tcaccaaagc 
cactggagta 
tcctccagct 
ttaagggaag 
agagcaaact 
agaggaactt 
tctgcaaact 
cagtatcaga 
caacgatagt 
gcagcctgga 
aattggacag 
atctctggga 
aagaaatggg 
gccttggcgt 
tttgaaatga 
caaacaaaca 



atacctgctg 
ctccgccgcg 
tccagagcca 
gcctgtgaat 
tgaatgggtt 
gatcgtctgt 
cgaggactac 
ggcctacgtc 
agtgtccctg 
tgaggtagag 
catcaccatt 
cacctgtgtt 
ctatgtgaat 
gagaggcttc 
cttctgcgag 
caaatggacg 
gagggagtgc 
gcagtctaag 
tgctctctac 
ggccctgttt 
ggcgctaaat 
agtgccacca 
tgatgtctct 
gaagattaaa 
ctcctccaag 
gaagaaccaa 
ctcgttaggg 
ggctgttccc 
gagaccacct 
aggagcgctg 
ggatgactgg 

ggtggtcggg 

tcatatcctg 
agcagccaaa 
cagcatccgc 
tgagcggcag 
tacccacaac 
gccggctaaa 
gcactgcacc 
ctgtgtgcgg 
ggaacccact 
tgggcccaac 
tgcaccccag 
gtacctaaat 
agcccagaat 
acgacatgaa 
gaacaggaag 
aaacccagac 
gaaaaaatat 



locus (loc): 
GenBank ACC. NO. (gbn) 
GenBank version (ver) : 
CAS registry no. (rn): 
sequence LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 

SOURCE: 
ORGANISM (ORGN): 



NUCLEIC ACID COUNT (NA) 
COMMENT: 

Contact: MGC help 



GENBANK. RTM. COPYRIGHT 2004 on STN 

BC048162 GenBank (R) 
BC048162 

BC048162.1 gi: 29145031 

503766-79-8 

3672 

mRNA; linear 

Rodents 

21 Oct 2003 

Mus musculus unc-5 homolog B (C. elegans), mRNA (cDNA 
clone IMAGE: 6417563), partial cds. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
sciurognathi ; Muridae; Murinae; Mus 
: 783 a 1137 c 1074 g 678 t 

desk 



Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 

cDNA Library Preparation: M. Bento Soares, University of Iowa 

CDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA sequencing by: University of Iowa, Dr. M. Bento soares and Dr. 

Thomas L. Casavant. 

Web site: http://genome.uiowa.edu 

Contact: bento-soares@uiowa.edu; tom-casavant@uiowa.edu 
Bonaldo,M.F. , Akabogu,i., Bair,T., Bair,J., Crouch,K., Davis, A., 
Fishier, K., Keppel,o, Kucaba,T., Lebeck,M., Melo.A., schaefer.K., 
Scheetz,T., Smith, c. , Snir,E., Tack,D., Trout, K., Walters, 3., 
casavant, T. , soares, M.B. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: Plate: Row: Column: 0 



This clone was selected for full length sequencing because it 

following selection criteria: matched mRNA gi : 21218439 



passed the 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 

JOURNAL (SO): 

OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



(bases 1 to 3672) 
strausberg,R.L. ; Feingold,E.A. ; Grouse, L.H.; 
Derge,J.G.; Klausner ,R.D. ; Collins, F. S. ; Wagner, L.; 
shenmen,C.M. ; schuler ,G.D. ; Al tschul , s . F. ; Zeeberg,B.; 
Buetow,K.H.; schaefer,C.F. ; Bhat,N.K.; Hopkins, R.F. ; 
Jordan, H.; Moore, T. ; Max,s.l.; Wang, J.; Hsieh,F.; 
Diatchenko, L. ; Marusina,K.; Farmer, A. A.; Rubin, G.M.; 
Hong,L.; stapleton,M. ; soares, M.B.; Bonaldo,M. F. ; 
Casavant, T. L. ; Scheetz,T.E. ; Brownstei n , M . J . ; 
Usdin,T.B.; Toshiyuki ,S . ; Carninci,p.; Prange,c.; 
Raha,S.S.; Loquellano,N.A. ; Peters, G. J.; Abramson , R.D. ; 
Mullahy,s.J.; Bosak,S.A.; McEwan,P.j.; McKernan , K . J . ; 
Malek,J.A.; Gunaratne,P.H. ; Richards, s.; Worley,K.C; 
Hale,S.; Garcia, A.M.; Gay,L.J.; Hulyk,S.W.; 
villalon,D.K. ; Muzny,D.M. ; sodergren , E . J . ; Lu,x.; 
Gibbs,R.A.; Fahey,J.; Helton, E.; Ketteman,M.; Madan.A.; 
Rodrigues,s. ; Sanchez , A. ; whiting, M.; Madan,A.; 
Young , A . c . ; shevchenko , Y . ; Bouf f ard , G . G . ; 
Blakesley,R.w. ; Touchman, J .w. ; Green, E.D.; 
Dickson, M. c. ; Rodriguez, A. c. ; Grimwood,J.; Schmutz,J.; 
Myers, R.M.; Butterfield,Y.S. ; Krzywinski ,M.I. ; 
Skalska,u.; Smailus ,d.e. ; Schnerch,A.; Schein,J.E.; 
Jones, s. J.; Marra,M.A. 

Generation and initial analysis of more than 15,000 

full-length human and mouse cDNA sequences 

Proc. Natl. Acad. Sci . U.S.A., 99 (26), 16899-16903 

(2002) 

CA 138:84325 
2 (bases 1 to 3672) 
Strausberg, R. 
Direct submission 

Submitted (06-MAR-2003) National Institutes of Health, 
Mammalian Gene Collection (MGC), cancer Genomics 
office, National cancer Institute, 31 Center Drive, 
Room 11A03, Bethesda, MD 20892-2590, USA 



FEATURES (FEAT): 

Feature Key Location 
===============+============= 

source 1..3672 



Qualifier 



gene 



CDS 



<1. .3672 



<1. .3028 



/orqanism="Mus musculus" 

/moT-type= ,, mRNA ,, 

/strain="C57BL/6" 

/db-xref="taxon : 10090" 

/cl one=" IMAGE : 6417563" 

/tissue-type="Brain, enriched 

mouse brain 12.5dpc" 

/cl one-1 i b= M NlH-BMAP-FlO M 

/lab-host= M DH10B M 

/note="Vector: pYX-ASC" 

/gene="unc5b" 

/db-xref="LocusID: 107449" 

/db-xref="MGl: 894703" 

/gene="unc5b" 

/codon-start=2 

/product="unc5b protein" 

/protei n-i d="AAH48162 . 1" 

/db-xref="Gl:29145032" 

/db-xref=" LocusID : 107449" 

/db-xref="MGI:894703" 



misc-feature 698. .955 



misc-feature 968. .1123 



misc-feature 1136. .1282 



misc-feature 1811. .2122 



misc-feature 2744. .3016 



/transl ation= u LREPGSPAASDSRSAPSRRA 

ASRRCAVASAPTEGTGWRCRAPAR 

TARRRRRRRRRGWGPGKNPEGEAPGPGDRSMRAR 

SGVRSALLLALLLCWDPTPSLAGV 

DSAGQVLPDSYPSAPAEQLPYFLLEPQDAYIVKN 

KPVELHCRAFPATQIYFKCNGEWV 

SQNDHVTQESLDEATGLRVREVQIEVSRQQVEEL 

FG L E D YWCQC VAWS S S GTT K S R R A 

YIRIAYLRKNFDQEPLAKEVPLDHEVLLQCRPPE 

GVPVAEVEWLKN EDVID PAQDTN F 

LLTIDHNLIIRQARLSDTANYTCVAKNIVAKRRS 

TTATVIVYVNGGWSSWAEWSPCSN 

RCGRGWQKRTRTCTNPAPLNGGAFCEGQAFQKTA 

CTTVCPVDGAWTEWSKWSACSTEC 

AHWRSRECMAPPPQNGGRDCSGTLLDSKNCTDGL 

CVLTLETSGDVALYAGLWAVFW 

VAVLMAVGVIVYRRNCRDFDTDITDSSAALTGGF 

HPVNFKTARPNNPQLLHPSAPPDL 

TASAGIYRGPVYALQDSADKIPMTNSPLLDPLPS 

LKIKVYNSSTIGSGSGLADGADLL 

GVLPPGTYPGDFSRDTHFLHLRSASLGSQHLLGL 

PRDPSSSVSGTFGCLGGRLSLPGT 

GVSLLVPNGAIPQGKFYDLYLHINKAESTLPLSE 

GSQTVLSPSVTCGPTGLLLCRPW 

LTVPHCAEVIAGDWIFQLKTQAHQGHWEEWTLD 

EETLNTPCYCQLEAKSCHILLDQL 

GTYVFMGESYSRSAVKRLQLAIFAPALCTSLEYS 

LRVYCLEDTPVALKEVLELERTLG 

GYLVEEPKPLLFKDSYHNLRLSLHDIPHAHWRSK 

LLAKYQEIPFYHVWNGSQRALHCT 

FTLERHSLASTEFTCKVCVRQVEGEGQIFQLHTT 

LAETPAGS LDALCSAPGNAITTQL 

GPYAFKIPLSIRQKICSSLDAPNSRGNDWRLLAQ 

KLSMDRYLNYFATKASPTGVILDL 

WEARQQDDGDLNSLASALEEMGKSEMLVAMATDG 

DC* 1 

/gene="Unc5b" 

/note="lG; Region: immunoglobul in" 

/db-xref="CDD:smart00409" 

/gene="Unc5b" 

/note="TSPl; Region: 

Thrombospondin type 1 repeats" 

/db-xref="CDD:smart00209" 

/gene="unc5b" 

/note="TSPl; Region: 

Thrombospondin type 1 repeats 1 ' 

/db-xref="CDD : smart00209" 

/gene= n Unc5b" 

/note="ZU5; Region: Domain present 
in ZO-1 and Unc5-like netrin 
receptors" 

/db-xref="CDD:smart00218" 
/gene="Unc5b" 

/note="DEATH ; Region: DEATH 
domain, found in proteins involved 
in cell death (apoptosis). 
Alpha-helical domain present in a 
variety of proteins with apoptotic 
functions. Some (but not all; of 
these domains form homotypic and 
heterotypic dimers" 
/db-xref="CDD:smart00005" 



SEQUENCE (SEQ): 

1 gctgagagag 
61 ggcctcccgg 
121 ggcgcctgcg 
181 aaagaacccc 
241 ggtgcggagc 
301 aggcgttgac 
361 gctgccgtac 
421 actgcactgc 
481 cagccagaat 
541 agaggtgcag 
601 ctggtgccag 
661 ccgcattgcc 



ccgggatccc 
cgctgcgcgg 
agaacggcga 
gagggagagg 
gcgctgctgc 
tctgctggcc 
ttcctattgg 
agagccttcc 
gaccacgtca 
atcgaggtgt 
tgcgtggcct 
tacttgcgca 



cagcggcgtc 
ttgcctctgc 
ggcggcggcg 
cgcccgggcc 
tggcgctgct 
aggtgctccc 
agccacagga 
ctgccacgca 
cacaggagag 
cacggcagca 
ggagctcttc 
agaactttga 



cgactcccgg 
gcctacggag 
aaggcgaagg 
gggggacagg 
gctttgctgg 
agactcctac 
cgcctacatc 
gatctacttc 
cctggatgag 
a 9tggaggaa 
gggaactacc 
ccaggagcct 



agcgctccta 
ggcacgggct 
cggcgaggct 
agcatgaggg 
gatccgacac 
ccatcagccc 
gtaaagaaca 
aagtgtaatg 
gccacaggct 
ctcttcgggc 
aagagtcgcc 
ctggccaagg 



gtcgccgggc 
ggcgctgccg 

ggggaccggg 

cccggagcgg 
cgagcctagc 
ctgcggagca 
agccagtgga 
gcgagtgggt 
tgcgggtgcg 
tcgaggacta 
gagcctacat 
aggtaccctt 



721 ggatcatgag gtccttctgc agtgccgccc accggaggga gtgcctgtgg ctgaggtgga 
781 atggctcaag aatgaagatg tcattgaccc cgctcaggac actaacttcc tgctcaccat 
841 tgaccacaac ctcatcatcc gccaggcgcg cctctcagac acggccaact acacctgtgt 

901 ggccaagaat atcgtggcca agcgccggag caccacggcc acagtcatcg tctatgtgaa 

961 tggaggctgg tccagctggg cagagtggtc accctgttcc aatcgctgtg gccgaggctg 

1021 gcagaagcgt actcggacct gcaccaatcc agccccactc aatggaggcg ccttctgtga 

1081 gggacaggcc ttccagaaga cagcttgcac caccgtgtgc ccagtggatg gagcgtggac 
1141 cgagtggagc aagtggtctg cctgcagcac agagtgtgcg cactggcgca gccgcgagtg 

1201 catggcaccg ccaccccaga acggaggccg tgactgcagc gggacgctac ttgactccaa 
1261 gaactgcact gatgggctgt gcgtgctgac cctggagaca tcgggagatg tggcactgta 

1321 cgcaggcctt gtggtggccg tctttgtggt ggtagcggtt ctcatggccg tgggagtgat 

1381 cgtataccgg agaaactgcc gggacttcga cacggacatc accgactcct ctgcggccct 

1441 cactggtggc ttccaccctg tcaacttcaa gactgcaagg cccaacaacc cgcagctcct 

1501 gcacccgtcc gcccctccag acctaacggc cagtgctggc atctaccgcg ggcctgtgta 

1561 tgccctgcag gactccgccg acaagatccc catgactaat tcgcccctgc tggatcccct 

1621 gcccagcctc aagatcaagg tctataactc cagcaccatc ggttctgggt ctggcctggc 

1681 tgatggagcc gacctgctgg gtgtcctccc gccgggcacg tacccaggcg atttctcccg 

1741 ggacacccat ttcctgcacc tgcgcagtgc cagccttggt tcccagcacc tcctgggcct 

1801 acctcgggac cccagcagca gtgtcagcgg cacctttggt tgcctgggag gaaggctgag 
1861 cctccccggc acaggggtca gcctgttggt accaaatgga gccattcccc agggcaagtt 

1921 ctatgacctg tatctacata tcaacaaggc cgaaagcacc ctcccacttt cagaaggttc 
1981 ccagacagta ttgagcccct cggtgacctg tgggcccaca ggcctactcc tgtgccgccc 

2041 tgtcgtcctc accgtgcccc actgtgctga agtcatcgct ggagactgga tctttcagct 

2101 caagacccag gcccatcagg gccactggga ggaggtggtg accttggatg aggagaccct 

2161 caacacaccc tgctactgcc agctggaggc taagtcctgc cacatcctgc tggaccagct 

2221 gggtacctac gtattcatgg gcgagtccta ctctcgctct gcagtcaagc ggctccagct 

2281 ggccatcttc gccccagccc tctgcacctc cctggagtat agcctcaggg tctactgtct 

2341 ggaggacaca cctgtagcac tgaaggaggt cctggagctg gagaggactc tgggtggcta 

2401 cttggtggag gagcccaagc ctttgctctt taaggacagt taccacaacc tacgcctctc 

2461 cctccatgac atcccccatg cccactggag gagcaaacta ctggccaagt accaggagat 

2521 tcccttctac cacgtctgga atggcagcca gagagccctg cactgcactt tcaccctgga 

2581 gaggcatagc ctggcctcca cggagttcac ctgtaaggtc tgcgtgcggc aggtcgaagg 

2641 ggaaggccag attttccagc tgcacacaac gttggccgag acgcctgctg gctccctgga 

2701 tgctctctgc tctgccccgg gcaatgccat caccacccag ctgggaccct atgccttcaa 

2761 gatacccctg tccatccgcc aaaagatctg cagcagcctg gacgccccca actcccgggg 

2821 caacgactgg aggctgttgg cgcagaagct gtccatggac cggtacctaa actacttcgc 

2881 caccaaagct agtcccacag gtgtcatctt agacctctgg gaagctcggc aacaggatga 

2941 cggggacctc aacagcctgg ccagtgcctt ggaggagatg ggcaagagtg agatgctggt 

3001 agccatggcc acagatggcg attgctgagt gcctgtgacc acaggcctgt ggggatcagt 

3061 aggagacggt gcaaggaggc ctggcagcct ctgcacaggg gtgcccagcc tccaccactc 

3121 ctggctcaca gcaggaatgg tccttcaact ccctccccgc cacaaccctc agaccaccac 

3181 caccagcctt agaaagtctc tgtgctctac tgccaagagg ccgggatcct ctggcccact 

3241 gtttccccag ctcactctgg ggtgggctga ggcctctggg acagctgaaa gccagaggct 

3301 ttcccctgcg acaacacacc accctcagcc ctgtgacttt ggggacccac aggtttcaat 

3361 tctgtgttca catggtcctg ggctagggac cgctctctta tcccgggtcg agttcagttc 

3421 aggcaaactg ctttttcctg tccacaagca gagagggaag attaggggag tgggggtggg 

3481 gggtggggga tgagcctcag aagtcagcga gactcaggta gtgagagagc aaaaacagta 

3541 agggcaaaga aagacccagt ttcttaggga acgcaaatga tttattatcc agatacttgg 

3601 atagttcctt tttaagaaaa caaaacaaac aaaaaaaagt aaaaaaaaaa aaaaaaaaaa 
3661 aaaaaaaaaa aa 
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locus (LOC): BC041156 GenBank (R) 

GenBank ACC. NO. (GBN) : BC041156 
GenBank version (VER) 

CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (ci) : mRNA; linear 
DIVISION CODE (CI): Primates 
DATE (DATE) : 21 Oct 2003 

DEFINITION (DEF): Homo sapiens unc-5 homolog c (C. elegans), mRNA (cDNA 

clone MGC: 48696 IMAGE : 5208108) , complete cds. 
KEYWORDS (ST): MGC 
SOURCE: Homo sapiens (human) 

ORGANISM (ORGN): Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 
NUCLEIC ACID COUNT (NA) : 577 a 569 c 585 q 539 t 
COMMENT: 

Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.a.G.e. consortium (LLNL) 



BC041156.1 Gl: 27370704 
492985-83-8 
2270 



DNA Sequencing by: Institute for Systems Biology 
http : //www . systemsbi ol oqy . org 
contact : amadanOsystemsbi ol ogy . o rg 

Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha 
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting 
clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
series: IRAK Plate: 84 Row: d Column: 5 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 16933524. 



REFERENCE: 

AUTHOR (AU): 



TITLE (TI) : 

JOURNAL (SO): 

OTHER SOURCE 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO): 



(OS) 



1 (bases 1 to 2270) 
strausberg,R.L. ; Feingold.E.A. ; Grouse, L.H.; 
Derge,J.G.; Klausner ,R.D. ; Collins, F. S. ; Wagner, L.; 
shenmen,C.M. ; Schuler ,G.D. ; Altschul ,s . F. ; Zeeberg,B.; 
Buetow,K.H.; Schaefer.C.F. ; Bhat,N.K.; Hopkins ,R. F. ; 
Jordan, H.; Moore, T.; Max,s.l.; Wang, J.; Hsieh,F.; 
Diatchenko,L. ; Marusina,K.; Farmer, A. A.; Rubin, G.M.; 
Hong,L.; stapleton,M. ; Soares,M.B.; Bonaldo ,M. F. ; 
Casavant,T.L. ; scheetz,T.E. ; Brownstei n , M . J . ; 
Usdin,T.B.; Toshiyuki ,s. ; Carninci,p.; Prange,C; 
Raha,s.s.; Loquellano,N.A. ; Peters, G. J.; Abramson,R.D. 
Mul 1 ahy , s . J . ; Bosak.S.A.; McEwan,P.J.; McKernan , K . J . ; 
Malek,J.A.; Gunaratne,P.H. ; Richards, s.; Worley,K.C; 
Hale,s.; Garcia, A.M.; Gay,L.J.; Hulyk,S.W.; 
villalon,D.K. ; Muzny , D . M . ; sodergren , E . J . ; Lu,X.; 
Gibbs,R.A.; Fahey, J.; Helton, E.; Ketteman, M.; Madan, A. 
Rodrigues, s. ; Sanchez, A.; whiting, M.; Madan, A.; 
Young, A. c; Shevchenko, Y. ; Bouffard ,G.G. ; 
Blakesley.R.W. ; Touchman, J .W. ; Green, E.D.; 

Dickson, M. c. ; Rodriguez, A. C. ; Grimwood,J.; schmutz,J.; 
Myers ,R.M.; Butterfield,Y.S. ; Krzywinski ,M. I. ; 
Skalska,u.; smailus,D.E. ; Schnerch,A.; Schein,J.E.; 
Jones, s. J.; Marra,M.A. 

Generation and initial analysis of more than 15,000 

full-length human and mouse cDNA sequences 

Proc. Natl. Acad. Sci . U.S.A., 99 (26), 16899-16903 

(2002) 

CA 138:84325 

2 (bases 1 to 2270) 
Strausberg,R. 
Direct Submission 

Submitted (16-DEC-2002) National Institutes of Health, 
Mammalian Gene Collection (MGC) , cancer Genomics 
office, National Cancer institute, 31 Center Drive, 
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FEATURES (FEAT): 

Feature Key Location 

source 1..2270 



gene 



CDS 



1. .2270 



350. .2143 



Qualifier 

/organism="Homo sapiens" 

/mol-type="mRNA" 

/db-xref="taxon:9606" 

/cl one="MGC : 48696 IMAGE : 5208108" 

/tissue-type="Lung, spleen, fetal, 

pooled" 

/clone-lib="NIH-MGC-122" 

/lab-host="DHl0B" 

/note="Vector: pCMV-SP0RT6" 

/gene="UNC5c" 

/note="synonym: UNC5H3" 

/db-xref="LocusID:8633" 

/db-xref="MIM: 603610" 

/codon-start=l 

/product="UNC5C protei n" 

/protei n-i d="AAH41156 . 1" 

/db-xref="GI:27370705" 

/db-xref ="LocusID : 8633" 

/trans! ation="MRKGLRATAARCGLGLGYLL 

QM LV L PA L A L L S AS GTGS AAQDDD 

FFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKP 

VN L YCK AS PATQI Y FKCN S EWVHQ 

KDHIVDERVDETSGLIVREVSIEISRQQVEELFG 

P EDYWCQCVAWS S AGTTKS RKAYV 

R I AY LRKTFEQEPLGK E VS LEQEVLLQCRPPEGI 

PVAEVEWLKNEDIIDPVEDRNFYI 

IIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTT 



misc-feature 866. .1123 
misc-feature 1136. .1291 

misc-feature 1304. .1450 



misc-feature 



1988. .2140 



ATVIVYVNGGWSTWTEWSVCNSRC 

GRGYQK RT RTCTN PA P LN GGA FCEGQS VQK I ACT 

TLCPVDGRWTPWSKWSTCGTECTH 

WRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCM 

QS FIYPISTEQRTQNEYGFSSAPD 

SDDVALYVGIVIAVIVCLAISVWALFVYRKNHR 

DFESDIIDSSALNGGFQPVNIKAA 

RQDLLAVPPDLTSAAAMYRGPVYALHDVSDKIPM 

TNSPILDPLPNLKIKVYNTSGAVT 

PQDDLSEFTSKLS PQMTQS L L E N EA LS LKNQS L A 

RQTDPSCTAFGSFNSLGGHLIVPN 

SGVSLLIPAGAIPQGRVYEMYVTVHRKETMR" 

/note="iG ; Regi on : immunogl obul in" 

/db-xref="CDD:smart00409" 

/note="TSPl; Region: 

Thrombospondin type 1 repeats" 

/db-xref ="CDD : smart00209" 

/note="TSPl; Region: 

Thrombospondin type 1 repeats" 

/db-xref="CDD:smart00209" 

/note="zu5; Region: Domain present 

in zo-1 and Unc5-like netrin 

receptors" 

/db-xref="CDD:smart00218" 



SEQUENCE (SEQ): 

1 cagacttcgg 
61 ggccgccttg 
121 gtcagttaga 
181 tccccctttt 
241 cttcggactg 
301 gcgcgtctct 
361 tctgcgggcg 
421 gctacctgcc 
481 cttttttcat 
541 ccttattgag 
601 agcaagccct 
661 ccacatagta 
721 tgagatttcg 
781 tgtggcctgg 
841 tctacggaag 
901 cttactccag 
961 tgaagacata 
1021 catcataaag 
1081 tgttgccaag 
1141 cacctggacg 
1201 aaggacttgt 
1261 gcagaaaata 
1321 gtggtctact 
1381 ccccaagaat 
1441 tgggctttgc 
1501 atatggattt 
1561 agcagtgatc 
1621 tcatcgtgac 
1681 tgtgaacatc 
1741 tgcagccatg 
1801 gaccaactct 
1861 aggtgctgtc 
1921 gacccagtcg 
1981 gactgatcca 
2041 tcccaattca 
2101 cgaaatgtat 
2161 cactgacttt 
2221 aaaaaaaaaa 



cgtgcggcgg 
cgcctccctg 
aggcattggg 
ggcccctgcc 
cttcgcggtg 
ctgtccccct 
acagcggccc 
ctggccctgc 
gaactcccag 
cctgaagaag 
gccacccaga 
gatgaaagag 
cgccagcaag 
agctccgcgg 
acatttgagc 
tgtcgaccac 
attgatcccg 
caggcccgac 
aggaaaagta 
gagtggtctg 
accaacccgg 
gcctgtacta 
tgtggaactg 
ggaggcaagg 
atgcagagtt 
tcttctgctc 
gtttgcctgg 
tttgagtcag 
aaggcagcaa 
tacagaggac 
ccaattctgg 
accccccaag 
ttgttggaga 
tcctgtaccg 
ggagtcagct 
gtgactgtac 
accaatgtct 
aaaaaaaaaa 



agctgcggct 
gacttcccaa 
aaagagggac 
tttggagaaa 
cacggattca 
cccctccccc 
gctgcggact 
tcagcgccag 
aaacttttcc 
cttatattgt 
tctatttcaa 
tagatgaaac 
tggaagaact 
gtaccacaaa 
aggaacccct 
ctgaagggat 
ttgaagatcg 
tctctgatac 
caactgccac 
tgtgtaacag 
caccactcaa 
cgttatgccc 
agtgcaccca 
actgcgacgg 
tcatttatcc 
ctgattcaga 
cgatctctgt 
atattattga 
gacaagatct 
ctgtctatgc 
atccactgcc 
atgacctctc 
atgaagccct 
catttggcag 
tgctgattcc 
acaggaaaga 
aacttggagg 
aaaaaaaaaa 



gctcctgctg 

gtgggggcag 

cttcaaactc 
gtggagtgtg 
gcttctgccc 
cggcacacct 
gggactggga 
cggcactggc 
ttctgatcca 
gaagaataag 
gtgtaatagc 
ttccggtctc 
ctttggacct 
gagccggaag 
aggaaaggaa 
cccagtggct 
gaatttttat 
tgcaaattac 
tgtcatagtc 
ccgctgtgga 

tgggggtgcc 

agtggatggc 
ctggcgcagg 
cctcgtcttg 
catttcaact 
tgatgttgct 
agttgtggcc 
ctcttcggca 
gctggctgta 
cctgcatgac 
caacctgaaa 
tgagtttacg 
cagcctgaag 
cttcaactcg 
cgctggggcc 
aactatgagg 
agaattgtac 
aaaaaaaaaa 



ctggggggag 
agcagtcgct 
ctcctcggcg 
gcgcttggtt 
agtggggctt 
ctgtctacga 
tacttgctgc 
tccgccgccc 
cctgagcctc 
cctgtgaacc 
gaatgggttc 
attgtccggg 
gaagattact 
gcgtatgtgc 
gtgtctttgg 
gaggtggaat 
attattattg 
acctgtgttg 
tatgtcaacg 
cgagggtatc 
ttctgtgaag 

a ggtggacgc 

agggagtgca 
caatccaaga 
gaacagagaa 
ctctatgttg 
ttgtttgtgt 
ctcaatgggg 
cccccagacc 
gtctcagaca 
atcaaagtgt 
tccaagctgt 
aaccagagtc 
ctgggaggtc 
attccccaag 
taaatatgct 
aatagacaca 
aaaaaaaaaa 



cggcggaggt 
cgcgggagac 
tctcctctcc 
gtcgttattt 
tcagctgttt 
tgaggaaagg 
aaatgctcgt 
aagatgatga 
tgccacattt 
tgtactgtaa 
atcagaagga 
aagtgagcat 
ggtgccagtg 
gcattgcata 
aacaggaagt 
ggttgaaaaa 
atcacaacct 
ccaaaaacat 
gtggctggtc 
agaaacgtac 
ggcagagtgt 
catggagcaa 
cggcgccagc 
actgcactga 
cccagaatga 
ggattgtgat 
atcggaagaa 
gctttcagcc 
tcacgtcagc 
aaatcccaat 
acaacacctc 
cccctcagat 
tagcaaggca 
accttattgt 
ggagagtcta 
ttttctggtg 
ataaacccaa 



L2 ANSWER 94 OF 104 



LOCUS (LOC): AK031655 
GenBank ACC. NO. (GBN) : AK031655 



GenBank version (ver) 
CAS registry no. (rn) 

SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE) : 
DEFINITION (DEF) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 
GenBank (R) 
Gl:26327502 



AK031655.1 
486389-66-6 
3790 

mRNA; linear 

High-Throughput CDNA sequencing 
3 Apr 2004 

Mus musculus 13 days embryo male testis cDNA, RIKEN 



KEYWORDS (ST): 
SOURCE: 
ORGANISM (ORGN) 



full-length enriched library, clone :6030473H24 product: 

***unc5*** homolog (c. elegans) 3, full insert 
sequence. 
HTC; CAP trapper 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; craniata; vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 

COMMENT : 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . ri ken . j p/ 
gsc. riken. jp/. 
1 

Carninci, P.; Hayashizaki , Y. 
High-efficiency full -length cDNA cloning 
Meth. Enzymol., 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci, P.; shibata,Y.; Hayatsu,N.; Sugahara,Y. ; 
shibata,K.; Itoh,M.; Konno.H.; Okazaki,Y.; 
Muramatsu,M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 
cDNAs to prepare full-length cDNA libraries for rapid 
discovery of new genes 
Genome Res., 10 (10), 1617-1630 (2000) 
CA 134:305920 
3 

ltoh,M.; Aizawa,K.; Nagaoka,S.; Sasaki, N. 
Konno,H.; Akiyama,J.; Nishi,K.; 
Tashiro,H.; ltoh,M.; Sumi,N.; lshii,Y.; 
Hazama,M.; Nishine,T.; Harada,A.; 
Matsumoto,H. ; Sakaguchi ,S. ; Ikegami ,T. ; 
Fujiwake,S.; Inoue,K.; Togawa,Y.; 
lzawa,M.; Ohara,E.; Watahiki.M.; Yoneda,Y.; 
Ishikawa,T.; Ozawa,K.; Tanaka,T.; Matsuura,S.; 



URL: http ://fantom 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO) : 

OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 



Shibata, K. ; 
Carninci , P. 
Kitsunai ,T. ; 
Nakamura,s. ; 
Yamamoto, R. ; 
Kashiwagi ,K. 



Muramatsu,M. ; lnoue,Y.; Kira,A. 



TITLE (TI): 



JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE 
REFERENCE: 

AUTHOR (AU): 



Kawai , J . ; okazaki ,Y. 
Hayashizaki ,Y. 
RIKEN integrated sequence analysis (RISA) 
system — 384-format sequencing pipeline with 384 
multi capillary sequencer 
Genome Res., 10 (11), 1757-1771 (2000) 
4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA 
collection 

Nature, 409, 685-690 (2001) 
(OS): CA 134:203311 
5 

The FANTOM Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 
annotation of 60,770 full-length cDNAs 
Nature, 420, 563-573 (2002) 
(OS): CA 138:131939 

6 (bases 1 to 3790) 

Adachi,J.; Aizawa,K. ; Akimura,T.; Arakawa,T.; Bono,H.; 
Carninci, P.; Fukuda,s.; Furuno,M.; Hanagaki,T.; 
Hara,A.; Hashizume ,w. ; Hayashida,K. ; Hayatsu,N.; 
Hiramoto,K.; Hiraoka.T.; Hirozane.T.; Hori,F.; 
lmotani,K.; lshii,Y.; Itoh,M.; Kagawa,l.; Kasukawa,T.; 
Katoh,H.; Kawai, J.; Kojima,Y. ; Kondo,S.; Konno,H.; 
Kouda,M.; Koya,S.; Kurihara.c. ; Matsuyama,T. ; 
Miyazaki,A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura, K.; Numazaki,R.; ohno,M.; ohsato,N.; okazaki, Y.; 
Saito,R.; Saitoh, H.; sakai,c; sakai,K.; sakazume,N.; 
Sano,H.; Sasaki, D.; shibata, K.; shinagawa,A. ; 
Shiraki,T.; Sogabe,Y.; Tagami,M.; Tagawa,A.; 
Takahashi , F. ; Takaku-Akahira,s. ; Takeda,Y. ; Tanaka,T. ; 
Tomaru,A.; Toya,T.; Yasunishi ,A. ; Muramatsu ,M. ; 



TITLE (TI): 
JOURNAL (SO): 



Hayashizaki ,Y. 
Direct Submission 

submitted (16-JUL-2001) Yoshihide Hayashizaki, The 
institute of Physical and Chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic sciences Center (GSC) , RIKEN Yokohama 
institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail igenome- 
res@gsc. riken.jp, URL: http://genome.gsc. riken.jp/, 
Tel :81-45-503-9222, Fax:81-45-503-9216) 



FEATURES (FEAT): 
Feature Key 

source 1..3790 



CDS 



118. .2970 



Locati on Qual i f i e r 

/orqanism="Mus musculus" 
/mol-type^'mRNA" 
/strain="c57BL/6J" 
/db-xref="FANTOM-DB:6030473H24 M 
/db-xref= M taxon : 10090" 
/clone= M 6030473H24 n 
/sex="male M 
/tissue-type="testis" 
/clone-lib="RIKEN full-length 
enriched mouse cDNA library" 
/dev-stage="13 days embryo" 
/note="unnamed protein product; 
putative unc5 homolog (c. elegans) 
3 (MGD | MGI: 1095412, GB I NM-009472 , 
evidence: BLASTN, 99%, match=464)" 
/codon-start=l 
/protein-id="BAC27495.1" 
/db-xref="Gl: 26327503" 
/trans! ation="MRKGLRATAARCGLGIGYLL 
QM L V L PA L A L LS AS GTGS AAQDD E 
FFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKP 
VN LYCKAS PATQI YFKCNS EWVHQ 
KDHWDERVDETSGLIVREVSIEISRQQVEELFG 
P E D YWCQCVAWS S AGTTK S R K A YV 
RIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGI 
PVAEVEWLKNEDIIDPAEDRNFYI 
TIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTT 
ATVIVYVNGGWSTWTEWSVCNSRC 
GRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACT 
TLCPVDGRWTSWSKWSTCGTECTH 
WRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCM 
QGFIYPISTEHRPQNEYGFSSAPD 
SDDVALYVGIVIAVTVCLAITVWALFVYRKNHR 
DFESDIIDSSALNGGFQPVNIKAA 
RQDLLAVPPDLTSAAAMYRGPVYALHDVSDKIPM 
TNSPILDPLPNLKIKVYNSSGAVT 
PQDD L AE FS S K LS PQMTQS L LEN EALN LKNQS LA 
RQTDPSCTAFGTFNSLGGHLIIPN 
SGVS L L I PAGAI PQGRVYEM YVTVH RK ENM R P PM 
EDSQTLLTPWSCGPPGALLTRPV 
ILTLHHCADPSTEDWKIQLKNQAVQGQWEDWW 
GE E N FTT PC YIQ LDAE ACH I LTE N 
LSTYALVGQSTTKAAAKRLKLAIFGPLCCSSLEY 
SIRVYCLDDTQDALKEVLQLERQM 
GGQLLEEPKALRFKGSIHNLRLSIHDIAHSLWKS 
KLLAKYQEIPFYHIWSGSQRNLHC 
TFTLERLSLNTVELVCKLCVRQVEGEGQIFQLNC 
TVSEEPTGIDLPLLDPASTITTVT 
GPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAH 
KLNLDRYLNYFATKSSPTGVILDL 
WEAQNFPDGNLSMLAAVLEEMGRHETWYLAAEG 
QY" 



SEQUENCE (SEQ): 

1 tggttatttc 
61 tcggctgttt 
121 aggaaaggtc 
181 atgcttgtgt 
241 gatgatgaat 
301 ccacacttcc 
361 tattgtaaag 
421 cagaaggacc 



tcaggactgc 
gcgcgtctcc 
tgagggcgac 
tacctgccct 
tttttcacga 
tcattgagcc 
ccagccctgc 
acgtagtaga 



ctggcggtgg 
tggtggcgtt 
agcggcccgc 
ggccctgcta 
actcccagaa 
cgaggaagct 
cacccagatc 
cgagagagta 



ccggatccag 
tcccttcccc 
tgcggactgg 
agcgccagtg 
acctttccat 
tacattgtga 
tacttcaagt 
gatgaaacct 



cctcctgcct 
gtaaacctct 
gaataggata 
gcaccggctc 
ctgacccacc 
agaacaagcc 
gcaacagcga 
ctggtctaat 



ggctgggctt 
gccgacgatg 
cttgctgcag 
cgccgctcaa 
tgagccattg 
tgtgaacctg 
gtgggttcat 
tgtgagagaa 



481 gtgagcattg agatttcacg ccagcaggtg gaggaactgt ttgggcctga agattactgg 
541 tgccagtgtg tggcctggag ctcagcaggc actacgaaga gtcggaaggc atacgtgcgc 
601 attgcgtatc tgcggaagac attcgagcag gaacccttgg gaaaggaagt gtccttggag 
661 caggaagtct tactccagtg tcggccacct gaagggatcc cagtggctga ggtagaatgg 
721 ctaaagaatg aagacataat tgatcctgct gaagatcgga acttttatat tactatcgat 
781 cacaacctga tcatcaagca agcccgactc tcagatacag caaattatac ctgtgttgcc 
841 aaaaatattg ttgccaagag aaaaagcacc acagccactg tcatcgtgta tgttaatggt 
901 ggctggtcca cctggacaga gtggtctgtg tgtaacagcc gctgtgggcg aggatatcag 
961 aaacgcacaa gaacctgcac caacccagcc ccactcaatg gtggggcctt ctgtgagggg 
1021 cagagtgtgc agaaaatagc atgcactacg ttatgtccag tggatggtag gtggacttca 
1081 tggagcaaat ggtcaacctg tgggactgaa tgcacccact ggcgcaggag ggagtgtaca 
1141 gcaccagccc ccaagaacgg gggtaaggac tgtgatggcc tggtcctcca atccaagaac 
1201 tgcactgatg ggctgtgcat gcagggattc atttacccca tttcaactga gcacagaccc 
1261 cagaatgaat atggattttc ttctgctcct gactcagatg atgtggctct ctacgtgggg 
1321 attgtgatcg ctgtaacagt ctgtctggcg atcactgttg tggtggccct gtttgtgtat 
1381 cggaagaacc accgtgactt tgagtctgac atcattgact cctcagcact caatggcggc 
1441 tttcagcctg tgaacatcaa ggctgccaga caagatctcc tggctgtccc ccctgacctc 
1501 acctcagctg cagccatgta caggggacct gtctatgctc tgcatgatgt ctcagacaaa 
1561 atcccaatga ccaactctcc aattctggac ccactaccca acttgaaaat caaagtgtac 
1621 aacagctcag gtgctgtcac tcctcaggat gaccttgccg agttctcatc caaactgtca 
1681 ccccagatga cccagtcctt gctagagaat gaggccctta acctgaagaa ccagagcctc 
1741 gcaagacaga ctgacccatc ctgcacagca tttggtacct tcaactctct tgggggtcac 
1801 ctcatcattc ctaattcagg agtaagcttg ctgattcccg ctggggccat tcctcagggg 
1861 agagtctatg aaatgtatgt gactgtacac aggaaagaaa atatgaggcc ccccatggaa 
1921 gactctcaga ccctacttac ccctgtggtg agctgtgggc ctcctggagc tctgctgacc 
1981 cgccctgtca tcctcactct gcatcactgt gcagacccca gcaccgagga ctggaagatc 
2041 cagctcaaaa accaggcagt gcagggacaa tgggaggatg ttgtggtggt tggggaggag 
2101 aacttcacaa ccccctgtta cattcagctg gatgcagagg cttgccatat cctcacagag 
2161 aacctcagta cctatgccct ggttgggcag tccaccacca aagcagctgc caagcgtctt 
2221 aaactggcca tctttgggcc cctctgctgc tcttccctgg agtacagcat tagagtctac 
2281 tgcctggatg acacacagga tgccctgaag gaagttctac aactggagag gcaaatggga 
2341 ggacagctcc tagaagaacc caaggctctt cgttttaaag gcagcatcca caacctgcgc 
2401 ctgtctattc atgacatcgc ccattccctc tggaagagca aattgctggc taagtatcag 
2461 gaaattccat tttaccacat ctggagtggc tctcaaagaa acctccactg caccttcact 
2521 ctggaaagac tcagcctaaa cacagtggaa ctggtttgca aactctgtgt gcggcaggtt 
2581 gaaggagaag ggcagatctt ccagctcaac tgtactgtgt cagaggaacc tactggcatc 
2641 gacttacctc tcctggaccc tgctagtacc atcaccactg tcaccggacc aagtgctttc 
2701 agcattcctc tccctatccg gcagaagcta tgcagcagcc tggatgcccc tcaaacaaga 
2761 ggccatgact ggaggatgct ggcccataaa ctcaacctgg acaggtactt gaattacttt 
2821 gccaccaaat cgagcccaac tggcgtaatc ctggatcttt gggaagcaca gaacttccca 
2881 gatggaaacc tgagcatgct ggcagccgtc ctggaagaaa tgggaagaca tgagacagtg 
2941 gtgtacttgg cagcagaagg acagtattga ccacactgga actaaagctg aaggacacaa 
3001 ttacacaggg agtctgtgtt caggggaacc acatctgagg aggaaatcca gataggacca 
3061 aggcgctcta caggcaagat ggcaacagga aacttggggg acagatataa ccaccaaggt 
3121 acacgcccac ttcattcgga cagcaccacc gcgggagtta agaaaaattg tgtaaatttg 
3181 taccttgaat ttaagaatca atctaatttt ctcttcgttg ggctgtatgc tgtatggtac 
3241 aggatcttac agtttcctag gaaacgcttt ttattgctat ccagatgtat ggataaactt 
3301 tcttaacaaa cccagtttct acaaatgttg tttacatcaa attggacagg gatgcagaca 
3361 ctgtccatgg ctcgttctat ttttgttcag atcatttgaa gttgaagctg tggacagttt 
3421 attgtgtcta tttcagatta gtaatttaca gagaaatcac agacttttgc taaaaatcat 
3481 gtacatcaag tgtctcagat aatcttccca tcagtgttct gtttctgaaa cttgttggac 
3541 cagtattggc attggtatca gggaagtgga gaatctaaat gtaaaggaga aattgagaaa 
3601 attccttata tcctggggta accccgttgg tatcctttgg gaacagagct ctagcattac 
3661 aggggaggta gctattcatg ttcctccaca caaacatttc tgtaccacat gtgtgtttgt 
3721 aataagcaat ttcaagtgtt ttttaaaaaa tattatcatt attattgatg attattacaa 
3781 atattctagc 
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LOCUS (LOC): AK048339 GenBank (R) 

GenBank ACC. NO. (GBN) : AK048339 

GenBank VERSION (VER) : AK048339.1 GI: 26092820 

CAS REGISTRY NO. (RN) : 492772-21-1 

SEQUENCE LENGTH (SQL): 2358 

MOLECULE TYPE (Cl) : mRNA; linear 

DIVISION CODE (CI): High-Throughput CDNA Sequencing 
DATE (DATE) : 3 Apr 2004 

definition (DEF): Mus musculus 16 days embryo head cDNA, RIKEN 

full-length enriched library, clone:Cl30050El5 product: 

***unc5*** homolog (C. elegans) 3, full insert 
sequence. 

KEYWORDS (ST): HTC; CAP trapper 

SOURCE: Mus musculus (house mouse) 

organism (ORGN): Mus muscul us 

Eukaryota; Metazoa; chordata; craniata; Vertebrata; 

Euteleostomi ; Mammalia; Eutheria; Rodentia; 



sciurognathi ; Muridae; Murinae; Mus 

COMMENT: 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . ri ken . j p/ 
URL : http : //fantom . gsc . ri ken . j p/ . 



(OS): 



REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 
OTHER SOURCE 

REFERENCE: 

AUTHOR (AU) : 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO): 



Carninci,P.; Hayashizaki ,Y. 
High-efficiency full-length cdna cloning 
Meth. Enzymol., 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci,P.; shibata,Y.; Hayatsu,N.; sugahara,Y.; 
Shibata,K.; ltoh,M.; Konno,H.; Okazaki,Y.; 
Muramatsu,M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

Shibata,K.; Itoh,M.; Aizawa,K.; Nagaoka,S.; Sasaki, N.; 
Carninci,P.; Konno,H.; Akiyama,J.; Nishi.K.; 
Kitsunai,T.; Tashiro,H.; ltoh,M.; Sumi,N.; lshii,Y.; 
Nakamura,s.; Hazama,M.; Nishine,T.; Harada,A. ; 
Yamamoto,R.; Matsumoto.H. ; Sakaguchi ,s. ; lkegami,T.; 
Kashiwagi ,K. ; Fujiwake,s.; lnoue,K. ; Togawa,Y. ; 
izawa.M.; ohara,E.; Watahiki,M.; Yoneda,Y. ; 
Ishikawa,T.; ozawa,K.; Tanaka,T.; Matsuura,s.; 
Kawai,J.; okazaki,Y.; Muramatsu,M. ; lnoue,Y.; Kira,A.; 
Hayashizaki ,Y. 

RIKEN integrated sequence analysis (RISA) 

system--384-format sequencing pipeline with 384 

multicapillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the fantom consortium. 

Functional annotation of a full-length mouse cDNA 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The fantom Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 2358) 

Adachi,J.; Aizawa,K.; Akimura,T. ; Arakawa.T.; Bono,H.; 
Carninci,P.; Fukuda,s.; Furuno,M.; Hanagaki,T.; 
Hara,A.; Hashizume,W. ; Hayashida,K. ; Hayatsu,N.; 
Hiramoto,K.; Hiraoka,T.; Hirozane,T.; Hori,F.; 
Imotani,K.; lshii,Y.; ltoh,M.; Kagawa,I.; Kasukawa.T. ; 
Katoh.H.; Kawai,J.; Kojima,Y. ; Kondo.s.; Konno,H.; 
Kouda,M.; Koya,S.; Kunhara,C. ; Matsuyama,T. ; 
Miyazaki,A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura ,K.; Numazaki.R.; ohno,M.; ohsato,N.; okazaki,Y.; 
Saito,R.; Saitoh,H.; Sakai,C; Sakai.K.; Sakazume,N.; 
Sano,H.; Sasaki, D.; shibata,K.; Shinagawa,A. ; 
Shiraki,T.; Sogabe,Y.; Tagami,M.; Tagawa,A.; 
Takahashi ,F. ; Takaku-Akahi ra,S . ; Takeda,Y.; Tanaka,T. ; 
Tomaru,A.; Toya.T.; Yasunishi ,A. ; Muramatsu ,M. ; 
Hayashizaki ,Y. 
Direct Submission 

Submitted (16-JUL-2001) Yoshihide Hayashizaki, The 
institute of Physical and chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, riken 
Genomic Sciences Center (GSC), RIKEN Yokohama 
Institute; 1-7-22 Suehiro-cho, Tsurumi-ku f Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome- 



res@gsc . ri ken . j p , URL : http : //genome . gsc . ri ken . i p/ , 
Tel :81-45-503-9222, Fax:81-45-503-9216) 



FEATURES (FEAT): 

Feature Key Location 

source 1..2358 



misc-feature 1. .2358 



Qualifier 

/organism="Mus musculus" 
/mol-type^'mRNA" 
/strain="c57BL/6J M 
/db-xref="FANTOM-DB:Cl30050El5" 
/db-xref="taxon : 10090" 
/clone= ,, Cl30050E15 n 
/tissue-type="head" 
/clone-lib="RlKEN full-length 
enriched mouse cDNA library" 
/dev-stage= ,! 16 days embryo ' 
/note="unc5 homolog (c. elegans) 3 
(MGD | MGI: 1095412, GB | NM-009472 , 
evidence: BLASTN, 100%, 
match=239)" 



SEQUENCE (SEQ): 

1 tcctgacttc caagaatgaa agatttttac taggaaaggc aaggggtgtg tgtgtgtgtg 
61 tgtgtgtgtg tgtgtgtgtg tgagtgaatt tttgcctcct tttacgtcac aatgtctctc 
121 ttatctgcca ccactaatat tatgtgtgct aacacctcca agtttagacc agaaaattat 
181 taatatctat tcagagataa ttcccttatt gcataaattt agtgttttta agacaagtaa 
241 cacatttctt tccacaggag aggaaagtac tttcatagct ttatgcatgt tggtgtttca 
301 caaattgaca gtctgttttg ataatagcca aatttgatgg ctactctctc atatatggct 
361 cagctgggag acaaattcat gtattcacta tggagattta caataaagaa aaatagacaa 
421 gttggccttt gttaggcaag taattctgta ttctaattta gagactgatt aaaagtcttc 
481 agagaacaaa ctttgcaagc acatggctaa aaataaatgc tggatcgtgg cagagtgtgt 
541 tggtttatac ctgtaagact cccacttaag taggctaaga catgaggaat gtaaggtcaa 
601 agccagcctg tgtaacaaag ccagatcctg tcagtaaaac aaagaaatca ttacattata 
661 attaacatca gagttaagtg gggaagaatc atttaaagta tccatctttg taccctggga 
721 taggtcatca tgagaatcca tgcaaatata ttggaaacaa ttcccctaag gagatgagag 
781 catcctggga gctgtactag gctaaccttc tttgtgtatg tagtgtaatt gttgagattt 
841 aaatataacc gtccccttat gtgtcccttt tattagcatg aaaagggaga gtgtgctcca 
901 ttaggcctta actgcacata gcacttttat gacttaggtc atggctttgc tggagcagca 
961 gagggagcca ctctaagaag aaacacaatc actgggggct gtagtcattt ctggaagatg 
1021 gtgtggatag aagctgagtc tagactccgg ccttgaagtt ttagcctaga tcccttttac 
1081 ttgacctttg catctgtgca acttaaaaac caatctgttc tcatttcctg caaactacat 
1141 ttttgagctt atttgaacag agtgtgtacg agaagtagta tgccagaggg aacaagtccc 
1201 agactgtttg ctgcttgttg gagagttaga cttgaataaa ctttatcatc gggatagatc 
1261 taccagcata ttaatttctg aactgtaacc ctgtatacag atatgttcct tgtgtcaagg 
1321 tggccaggat tatcccatga gtggtagtgg taatggagac agttccaaaa attttgtaca 
1381 tagaagatta atggtatttt atacatacaa gattaattgg ttggaagtag taaggaaggg 
1441 acacttggaa gttgaactag tccaggcaca tactctcatt cattcattca tcccaccatc 
1501 tctcatgctg gctgtgaact catgagccac ctacccacct tggcccctac ccccatttgt 
1561 ttctgggatt acaactgtgt acccccatgc ccatgatgga agtgggcttt tcttgaagtt 
1621 aaatttgcat ttcaaaagta ttacttatat tcagatgcaa cccaccccca cccccagcct 
1681 gaaatctggc aggaagagaa catcaggggt agagcctgcc accctatcgt tctgccactc 
1741 ccactgctgc tgcctgccgc ctagctgttc tggagctatc atgtggtcac ctgaaactgg 
1801 actccaagga tgatttggtg ggaatgggcc ccttctcctt cttcataacc cggtgtctcg 
1861 gaatagtaaa atcgaacctt gatcagactg atgtcttggt tccgttcttt cccgagtcac 
1921 ctagttccct cttctcttcc agattccaag atgcctcccg ggctagaacc cagacatgtg 
1981 ggccgctggc cggcctcaac attcagatac ctcccctact ttggtaatgg ggattgaatc 
2041 taggtacctg tacatgttat gatagcaagt gctatgctat gactctatgg ccccatgcct 
2101 actacttaga gtcagtggct attttgtctg gctttgggga ggtacaagct aataacagtg 
2161 aataccatgt gcctgtttta gttagcacaa tgcatcataa ttcctcggac acatcctagc 
III} tatat g act 9 tgatatgcca agttatttga ctattctgta cctcaatttc cccacatgta 
2281 aaatgagaaa aatatatacc ataggtcata ctaaggctaa attagtgtat gtaaaccatc 
2341 tacagtgata ggtagttc 
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GENBANK . RTM . COPYRIGHT 2004 on STN 



LOCUS (LOC): AK045251 
GenBank ACC. NO. (GBN) : AK045251 
GenBank VERSION (VER) : AK045251.1 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE) : 
DEFINITION (DEF): 



GenBank (R) 
Gl: 26090799 



492752-02-0 
3376 

mRNA; linear 

High-Throughput CDNA Sequencing 
3 Apr 2004 

Mus musculus 9.5 days embryo parthenogenote cDNA, RIKEN 
full-length enriched library, clone:Bl30051ol8 product: 
***unc5*** homolog (c. elegans) 3, full insert 



KEYWORDS (ST) : 
SOURCE : 
ORGANISM (ORGN) 



sequence. 
HTC ; CAP trapper 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 

COMMENT: 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Tissues were provided by Dr. Tomohiro Kono (Department of Animal 
Science, Tokyo University of Agriculture, 1737 Hunako Atsugi city, 
Kanagawa Prefecture, Japan) whose assistance we gratefully 
acknowledge. 

Please visit our web site for further details. 
URL : http : //genome . gsc . ri ken . j p/ 
URL : http : //f antom . gsc . ri ken . j p/ . 



REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 
JOURNAL (SO) : 
OTHER SOURCE (OS) I 

REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO) : 
OTHER SOURCE (OS) ; 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



carninci,P.; Hayashizaki , Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol., 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci,P.; shibata,Y.; Hayatsu,N.; Sugahara,Y.; 
Shibata,K.; ltoh,M.; Konno,H.; okazaki,Y.; 
Muramatsu,M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

Shibata,K.; ltoh,M.; Aizawa,K.; Nagaoka,s.; Sasaki, N.; 
Carninci,P.; Konno,H.; Akiyama,J.; Nishi,K.; 
Kitsunai.T.; Tashiro,H.; Itoh,M.; Sumi,N.; lshii,Y.; 
Nakamura,s.; Hazama,M.; Nishine,T.; Harada,A.; 
Yamamoto,R.; Matsumoto,H. ; Sakaguchi ,s. ; lkegami,T.; 
Kashiwagi ,K. ; Fujiwake,s. ; Inoue,K.; Togawa,Y.; 
lzawa,M.; ohara,E.; Watahiki,M.; Yoneda,Y.; 
Ishikawa,T.; Ozawa,K.; Tanaka,T.; Matsuura,s.; 
Kawai,J.; okazaki,Y.; Muramatsu,M. ; inoue.Y.; Kira,A.; 
Hayashizaki ,Y. 

RIKEN integrated sequence analysis (RISA) 

system--384-format sequencing pipeline with 384 

multicapillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cdna 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The FANTOM Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cdnas 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 3376) 

Adachi,J.; Aizawa,K.; Akimura,T. ; Arakawa,T.; Bono,H.; 
Carninci,P.; Fukuda,s.; Furuno,M.; Hanagaki,T.; 
Hara,A.; Hashizume,W. ; Hayashida,K. ; Hayatsu,N.; 
Hiramoto,K.; Hiraoka,T.; Hirozane,T.; Hori,F.; 
Imotani,K.; lshii,Y.; ltoh,M.; Kagawa,l.; Kasukawa,T.; 
Katoh,H.; Kawai,J.; Kojima,Y.; Kondo,s.; Konno.H.; 
Kouda,M.; Koya,s.; Kunhara,c; Matsuyama,T. ; 
Miyazaki.A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura, K. ; Numazaki,R.; ohno,M.; ohsato,N.; okazaki,Y.; 
saito,R.; Saitoh, H.; sakai,C.; Sakai,K.; sakazume,N.; 
sano,H.; Sasaki, D. ; shibata,K.; shinagawa,A. ; 
Shiraki,T.; Sogabe,Y.; Tagami,M.; Tagawa,A. ; 



TITLE (TI): 
JOURNAL (SO) : 



FEATURES (FEAT) 
Feature Key 



Takahashi , F. ; Takaku-Akahira,s. ; Takeda,Y.; Tanaka,T. ; 
Tomaru,A.; Toya,T. ; Yasunishi ,A. ; Muramatsu ,M. ; 
Hayashizaki ,Y. 
Direct Submission 

Submitted (16-JUL-2001) Yoshihide Hayashizaki, The 
Institute of Physical and chemical Research (RIKEN) , 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic Sciences Center (GSC) , RIKEN Yokohama 
Institute; 1-7-22 suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome- 
resOgsc . ri ken . j p , URL : http : //genome . gsc . ri ken . i p/ , 
Tel :81-45-503-9222, Fax:81-45-503-9216) 



Location 



Qualifier 



source 1..3376 



misc-feature 1. .3376 



/orqanism="Mus musculus" 
/moT-type="mRNA" 
/strain= M C57BL/6J" 
/db-xref="FANTOM-DB:Bl30051Ol8" 
/db-xref="taxon : 10090" 
/clone="Bl30051ol8" 
/tissue-type="parthenogenote" 
/clone-lib="RIKEN full-length 
enriched mouse cDNA library" 
/dev-stage="9.5 days embryo" 
/note="unc5 homolog (c. elegans) 3 
(MGD | MGI: 1095412, GB | NM-009472 , 
evidence: BLASTN, 100%, 
match=239)" 



SEQUENCE (SEQ): 

1 gggttatttc tcaggactgc ctggcggtgg ccggatccag cctcctgcct ggctgggctt 
61 tcggctgttt gcgcgtctcc tggtggcgtt tcccttcccc gtacacctct gccgacgatg 
121 aggaaaggtc tgagggcgac agcggcccgc tgcggactgg gactaggata cttgctgcag 
181 atgcttgtgt tacctgccct ggccctgcta agcgccagtg gcaccggctc cgccgctcaa 
241 ggtaagaggg tcttcagggc gcggcagtca cctccctttg cttctgtctt ccctctcttc 
301 tacctctatt tgtccccggc gctccttgat tcctcacctg cctagttgcc cccaatggaa 
361 accactttaa atcccacttt cccaactagt taagaaacca gattttcagt cttagctctg 
421 gatctgggta aggatggacc cggggtgtgt taagtgtagc tgctgtctcg gtgcctgtgg 
481 gatcctggag tgtcttttgc actttcagac ccacagacgc acggttttga atattgtgta 
541 cccgggggtt tccagaggca ctgatagtgg cgttgtggta tgtgccccaa tatgtgtagt 
601 gtgccgaggc ttctcgctgt gcgcgatggc tgcgcttgca gggttttctt ccacctccgc 
661 taaggaaata tgtccattac cgccaaggac ccataggcgc tggtgcccct agactgctga 
721 tggggcgcta tggagatgct caattcgccc agaaaggaca ggctccgggt gtgcctgctt 
781 ctccactcac aacctcttcg tgtcccgctc cggacaggcg atccttgaaa aacttcaaaa 
841 ccacagctct agtgggtgac tctagctgtt cgaaccaaat ctgcaactct ctgggggaaa 
901 ggacgctgct aaccagttta taactgacat tttaatgcgt gtaagcctcg aagtgtgtgg 
961 cactctgagc aaagccggag taggaggaca gtggtccaga gttgggaatg tcttgtacgg 
1021 gtagagggaa gacacctaga ctgaggttca tcttggaggc ttctttcttg ttcattgagt 
1081 tttagaggga ggtgcagctc ccagacttat tcaaaccagt tcccacccag atccttcttt 
1141 gccattttaa agatttattt atttatttat ttatttattt atttatttat ttattgtcta 
1201 tagccagaag gcaacacagc tttgcctgga gagagaggca acaattcaga accagctcag 
1261 agaccttccc ttttctacac acacacacac acacacacac acacacacac acttctttgt 
1321 gaatttggag agtttggggg gaggagcagc aaccctcctt ccatgatgct cacagactgg 
1381 tttattccaa ctgcagcctg catttcctaa aatctacagg cactgatgca gggaggtggg 
1441 tgctgaaaga aagcccagag gccaggcaag cctggtcctc atgaggaggt atgcgatgag 
1501 caaggcagtg tgaatctgag atagctactt agggagtaaa ttgctcctgc tacactgcta 
1561 agtggccatg ctcataaatc ctgccttcct ttctggacag acgcttccta cccttgtggg 
1621 tcagctctgg tttgcttatg cacacatcca gagcacctct gaccatgttc aagggaatcc 
1681 gtgctagaaa gcctagagtt cttgaaagtg gtgagaagtt ttcaggactc tagccagaaa 
1741 tgcttccagg aggatcttag gaagccaacg tggacaacga tcttaggaag cctaagggtt 
1801 ctggaatcag ctgtatgagt cctgctggaa ttcggtgtca tttttctgtg cacaaaggac 
1861 taagccactg gggggtgagg ggaagggttt gggaaaagga aaagttaaaa gtccccaaag 
1921 atgcaaaaag gaggcaggat ccctgtggag agcttacaac agccaaatta aaaagactgg 
1981 gtaggagaag gatcgatgaa gataatcaaa caggggactg taagtggatt aatattgaaa 
2041 gaagggagca gagactaagc gggaaggagc tggggatacg ttctgcctcc tgcactttct 
2101 cttggtctac accagacaag gcaaaggttg aggattaatt ctgatcttca gcaggggaag 
2161 aaaagaaggt ttagggggtg tacgccattc tggtgtggat tttctccatt tgctctcgat 
2221 cagggagaat aactcccccc cccttcaaat cacatggtct gagatgcagg actctcactt 
2281 ttatctttgt tatcctccat gttccagagc accagctctt cctgtgttat ctgtgagaca 
2341 tcaaagggag tgtagagcac agaaatctgc tatcagctac tgtccctgtg gctgctactc 
2401 attagccagg cctgtttact actgacctgc tgttaacact ctattctcag ccagtcactg 
2461 taggacagtc tggtgggcag gctcccattc ccttcagtat ctcctttctg tccaatttac 
2521 actcgagctt ccacacatac cccaaaggta ccatttgctt gttctgctgc attaatgact 
2581 ccagtggaca gtacttttca tgaatggaag tacatatgca actagaatgc aagacattca 



2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 



tgaagtactt 
gaagctttcg 
accaaccaca 
tgcaactgtg 
gacctgtagg 
gctgggaaac 
tttactagga 
gaatttttgc 
gtgctaacac 
ttattgcata 
agtactttca 
agccaaattt 
cactatggag 



tacaatgtga 
aaaaaaaaaa 
acatggttgt 
gattaaagac 
caacaacttt 
cttttctcca 
aaggcaaggg 
ctccttttac 
ctccaagttt 
aatttagtgt 
tagctttatg 
gatggctact 
atttac 



tgaaggtttc 
agtagatctc 
agaagcacag 
agacagcagt 
ccccctctct 
gggaattctc 
gtgtgtgtgt 
gtcacaatgt 
agaccagaaa 
ttttaagaca 
catgttggtg 
ctctcatata 



gaacctgttg 
gagttgaaat 
agcgagtatt 
catgttaaac 
acccctgctc 
agggctcctg 
gtgtgtgtgt 
ctctcttatc 
attattaata 
agtaacacat 
tttcacaaat 
tggctcagct 



gagcagtgct 
cagcccaggg 
atttatgagt 
aggtccttgg 
taattggttc 
acttccaaga 
gtgtgtgtgt 
tgccaccact 
tctattcaga 
ttctttccac 
tgacagtctg 
gggagacaaa 



gagaaaatga 
gaggagaaaa 
gagggagttt 
gaaattctct 
attgagtatt 
atgaaagatt 
gtgtgtgagt 
aatattatgt 
gataattccc 
aggagaggaa 
ttttgataat 
ttcatgtatt 
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GEN BANK . RTM . COPYRIGHT 2004 on STN 



LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank version (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 



GenBank (R) 
GI: 26084863 



AK035842 
AK035842 
AK035842.1 
492705-68-7 
3620 

mRNA; linear 

High-Throughput CDNA Sequencing 
3 Apr 2004 

Mus musculus 16 days neonate cerebellum cDNA, RIKEN 
full-length enriched library, clone :9630009N10 product: 

***unc5*** homolog (c. elegans) 3, full insert 
sequence. 
HTC; CAP trapper 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 

COMMENT: 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . ri ken . jp/ 
URL : http : //f antom . gsc . ri ken . jp/ . 
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AUTHOR (AU) : 
TITLE (TI): 
JOURNAL (SO): 
OTHER SOURCE (OS) 

REFERENCE: 

AUTHOR (AU): 



TITLE (TI); 
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Carninci.P.; Hayashizaki , Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol., 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci,P.; shibata,Y.; Hayatsu,N.; Sugahara,Y. ; 
shibata,K.; Itoh,M.; Konno,H.; okazaki,Y.; 
Muramatsu.M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res. f 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

Shibata,K.; Itoh,M.; Aizawa,K.; Nagaoka,S.; Sasaki, N.; 
Carmnci.P.; Konno,H.; Akiyama,J.; Nishi,K.; 
Kitsunai ,T. ; Tashiro.H.; ltoh,M.; sumi,N.; lshii,Y.; 
Nakamura,s.; Hazama,M.; Nishine,T.; Harada,A. ; 
Yamamoto,R.; Matsumoto,H . ; Sakaguchi ,S. ; Ikegami.T.; 
Kashiwagi ,K. ; Fujiwake,s.; lnoue,K.; Togawa,Y.; 
lzawa,M.; ohara,E.; Watahiki,M.; Yoneda,Y.; 
Ishikawa,T.; Ozawa,K.; Tanaka,T. ; Matsuura.s.; 
Kawai,J.; okazaki,Y.; Muramatsu,M. ; Inoue,Y.; Kira,A.; 
Hayashizaki ,Y. 

RIKEN integrated sequence analysis (RISA) 

system--384-format sequencing pipeline with 384 

multicapillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the fantom Consortium. 

Functional annotation of a full-length mouse cDNA 
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OTHER SOURCE (OS) : 
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AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
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AUTHOR (AU) : 
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Nature, 409, 685-690 (2001) 
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5 

The FANTOM consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 
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Adachi,J.; Aizawa,K.; Akimura,T.; Arakawa,T.; Bono,H.; 
Carninci,P.; Fukuda,s.; Furuno,M.; Hanagaki,T.; 
Hara,A.; Hashizume ,w. ; Hayashida,K. ; Hayatsu,N.; 
Hiramoto,K.; Hiraoka,T.; Hirozane.T.; Hori,F.; 
lmotani,K.; lshii,Y.; Itoh.M.; Kagawa,l.; Kasukawa,T.; 
Katoh,H.; Kawai,J.; Kojima,Y.; Kondo,s.; Konno,H.; 
Kouda,M.; Koya,S.; Kurihara,c. ; Matsuyama,T. ; 
Miyazaki,A.; Murata,M.; Nakamura,M.; Nishi,K.; 
Nomura, K.; Numazaki,R.; ohno,M.; ohsato,N.; okazaki,Y.; 
Saito,R.; Saitoh,H.; Sakai,C; Sakai,K.; Sakazume ,N. ; 
Sano,H.; Sasaki, D.; Shibata,K.; Shinagawa,A. ; 
Shi raki , T . ; Sogabe , Y . ; Tagami ,M. ; Tagawa , A . ; 
Takahashi ,F. ; Takaku-Akahi ra,s. ; Takeda,Y.; Tanaka,T.; 
Tomaru,A.; Toya,T.; Yasunishi ,A. ; Muramatsu,M. ; 
Hayashizaki , Y. 
Direct Submission 

submitted (16-JUL-2001) Yoshihide Hayashizaki, The 
institute of Physical and Chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic Sciences Center (GSC) , RIKEN Yokohama 
institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome- 
res@gsc . ri ken . j p , URL : http : //genome . gsc . ri ken . j p/ , 
Tel :81-45-503-9222, Fax:81-45-503-9216) 



FEATURES (FEAT): 

Feature Key Location 

source 1..3620 



misc-feature 



1..3620 



Qualifier 

/orqanism="Mus musculus" 
/moT-type="mRNA" 
/strain="C57BL/6J" 
/db-xref="FANTOM-DB:9630009NlO" 
/db-xref="taxon : 10090" 
/clone= M 9630009NlO" 
/tissue-type= n cerebellum M 
/clone-lib="RIKEN full-length 
enriched mouse cDNA library" 
/dev-stage="16 days neonate" 
/note="unc5 homolog (C. elegans) 3 
(MGD | MGI : 1095412 , GB | NM-009472 , 
evidence: BLASTN, 99%, match=464)" 



SEQUENCE (SEQ) : 

1 tttttttttt 
61 tgaaggatag 
121 ttagtaaagc 
181 gttaatacca 
241 ttccagccag 
301 aggtagaagc 
361 gaattcccca 
421 cctctgtgcc 
481 aagctggagt 
541 tgagggcaag 
601 acatacatgt 
661 aattaaaaaa 
721 gggaggtaga 
781 tccaggccat 
841 gaagaccctt 
901 gaagacatga 
961 tgtacaaatt 
1021 cctttggtgc 
1081 atcattgtaa 
1141 tgtaaggaaa 
1201 tttaggcagt 
1261 aaatagtgtg 



ttttaattca 
actctgtgct 
gcttgctgtg 
ggtatgacaa 
ccagtccggc 
cagagagatg 
gaacccaagt 
gaggtccgtg 
acacacacag 
aactgactcc 
actcacatgt 
cataggtaga 
ggcaggtaga 
ccagtagtgc 
gaggttgact 
gagttaaggg 
ttgttggctt 
caaaccagac 
cctattaaaa 
gcagaaacta 
ttcaaagaaa 
actcattatt 



aatttttatg 
ctattgtaag 
taatcaggag 
catacaattg 
caaattgtta 
gctcatgggt 
taaaagctgg 
gcagagatgg 
caggaagtca 
tgagagattt 
cacatacaca 
agactgagtg 
cctctgtgag 
ttagggagac 
tctgttcaca 
aaaagttcca 
ttatttctaa 
acacatacaa 
agcattttag 
ccatttatga 
aaaaatacaa 
ttttctaggt 



tattatacaa 
tggatgaaag 
ggcctgagtt 
tgctgaggaa 
aactctggta 
aaaagcactt 
atgtggctgc 
gaggattgcc 
cagcaggaga 
cctctgacct 
cacacacaca 
tggtggtgga 
ttcaaagcca 
cctatctcaa 
cacacacaca 
gtacatgagg 
atgaaatttt 
aggtatctac 
agagcatata 
catatgacat 
attttgagaa 
atagattgtc 



atgtttgcca 
agcagggctg 
cagatctcta 
gcagagacag 
agagcctatc 
gcctcataag 
ggtcgtctga 
cagaagctca 
cacctcacct 
ccacacttag 
cacacacaca 
tatctttaat 
gcctggttta 
aaaataaaag 
cacacacaca 
ccaaagagag 
gccagagagt 
agacactaga 
tcagtaagat 
aattttatat 
tatagtaaat 
aattcaaatt 



aagtgatgtc 
gggaggtcac 
ataccatggt 
gaggtttgcc 
tcaaaaagta 
cctgatgaca 
tatcccagcg 
ggtgccagcc 
caaaacaagg 
tctgtagcat 
cacacacata 
ccgagcactt 
catagtgagt 
gtgagtgatt 
aattcaggat 
aatcaaatgt 
tagcaattat 
gactgagaga 
gtgtaaaagt 
acatggaaaa 
taggtgctgg 
agcaaggtat 



1321 gtcatttaga atatcaatgt aaaagcaatt atatagttat gaggttaaga catttgaagg 
1381 atttttttcc tttgtgatac tggggataga tcatagacgt catgcatgcc gggcatggga 
1441 ttcaccacta agcacatacc cagctttaga aggaacattt tcaaattcta cctagttata 
1501 aaaataattg tgtaattgtg catgaacaca acacacacac aaatatacac acatgtatat 
1561 attgttcttg aatatatgtg ataaatattt ttgatataca tgccatctat aagtgattct 
1621 gaaagattac cacactttaa tgtagagtaa cttctaataa cagctcagtg caacttgaaa 
1681 tagaagtgca atcaaataat ctttaaaaac ataaaataaa agagatgtga tcttctctga 
1741 agatcacaca cagatctaca attaagattt ggttagataa aacatgaaaa tagtttaaaa 
1801 cttccagaat gtgatgattg tatgaccagt taaaaaggag tcgattagta catgcattga 
1861 gatatcaagg caggaagtca gaaaggaaag aaatagatac actaaatatg tatcctattt 
1921 cattatttaa agtaagttct agaagttatt gagtgacgtc accatcatca tgaccatcat 
1981 catcaccacc accaccatca tcatcaccat caccatcact attatcattc tggccatttg 
2041 ttggtcggtt actcggtttc agtaatggga cagccaaacc ttttaaatta aaacacaaaa 
2101 taaagaatct ttataggaaa ccattaggta attgaatttt aaaataccac aaacagagcc 
2161 aaggaggata tttttaaaaa cttatatcaa cattgattaa catgagtcaa agagataact 
2221 taattactat gcaaatattt ttaaattaca ttttagaaaa tgcactagca cccataagaa 
2281 aaccaaccaa cattacaaca tgtaattcta ggcaaaggaa tgctaattat aaataaacat 
2341 ctaagaagat gttagctact tcaatacaag aattaaagac ctttttttta ctggggcaca 
2401 aattgcaatg ccagtatgaa attagcacca gtactgaaat acattatacc cacccccatt 
2461 gcttctgcca ggcgagtgag tctagggact tcctacaagt aaaattctgg atataaacaa 
2521 agatttagca ttgcaagttg tctactgtgc cctgttgatt atgttctttc ctagaactga 
2581 ttactatagt gttgttgttg ttgttgctgt tgttgtttaa gtgaagtttt tccactaatg 
2641 agactgtgta aattctgtta gataaaactg aacacaaaat ccaaattaga gaacccagca 
2701 tccctgctgt attttgctgc tgtgcaaaga agggagatac acatgttttt gtatagcagt 
2761 tgaaactttc tgcataaaca cacaccaaga tgttgacagc taagaatcta gggtgatgac 
2821 ttgagacttt tgttgtttga tatttttcag tcttttgaat atattatttt catcataaat 
2881 aatgaaaccc actgaacagc tatgccactt tattgtagct ctctatcagc tactataaat 
2941 ccccaatcta gacaaactgt gtagaaattt ctcggtctct tatattagag ttccttgcca 
3001 tcttctgaga ccaaacactg caaagatgga atgggaatag aattctttct ggggtctttt 
3061 gtttgtttat ttgtttttat aaaactagat tcacctgcaa cttgggtcct cagctctata 
3121 ctttaacatt ctgattttat tattttgagg ggaccatggt ggttttttta aaagcaagaa 
3181 atattagaat atattaagtc ttgaaataca ttgcaaaact gaaatatttt tagtgtctta 
3241 aaaaaatcac atttttcctt aaatgacaga acagacaaaa gtcattgcaa acacattcat 
3301 tttcgagtta tatcttcctc agaacaacac atgcaaatca tatttagtag caatcacatt 
3361 tcctagcaac taagcatagc taattgcatt taacaaagtc tctaccaaga taatttcaca 
3421 tttgaaaaat taataaacac tttgaagaag agagagatct taaaaattct tttacttact 
3481 cctaacccag tggcttacat ctgctgttcc atcactcaga aggctgaggc aagaggattg 
3541 ttatcagttc aaaagcaagc ctacctagtg agttcaaggt caccctaagc tataacttga 
3601 gaccctttcc caaccaaatt 
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LOCUS (LOC): AK035038 GenBank (R) 

GenBank ACC. NO. (GBN) : AK035038 



GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 



AK035038.1 Gl: 26084357 
492700-62-6 
3050 

MOLECULE TYPE (CI) : mRNA; linear 
DIVISION CODE (CI): Hi gh-Throughput CDNA Sequencing 
DATE (DATE): 3 Apr 2004 

DEFINITION (DEF) : Mus musculus 12 days embryo embryonic body between 

diaphragm region and neck cDNA, riken full-length 
enriched library, clone:9430077M22 product: ***unc5 
homolog (C. elegans) 3, full insert sequence. 

KEYWORDS (ST): HTC; CAP trapper 

SOURCE: Mus musculus (house mouse) 

ORGANISM (orgn): Mus musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
sciurognathi ; Muridae; Murinae; Mus 

COMMENT: 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 

URL : http : //genome . gsc . ri ken . j p/ 

URL : http : //f antom . gsc . ri ken . jp/ . 
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Adachi,J.; Aizawa,K.; Akimura,T.; Arakawa,T. ; Bono,H.; 
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shiraki.T.; Sogabe,Y.; Tagami,M. 
Takahashi , F . ; Takaku-Akahi ra , s . ; 
Tomaru,A.; Toya,T.; Yasunishi,A. 
Hayashizaki ,Y. 
Direct Submission 

Submitted (16-JUL-2001) Yoshihide Hayashizaki, The 
institute of Physical and chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic sciences Center (GSC) , RIKEN Yokohama 
Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail :genome- 
res@gsc . ri ken . jp , URL : http : //genome . gsc . ri ken . ip/ , 
Tel: 81-45-503-9222, Fax:81-45-503-9216) 
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FEATURES (FEAT): 

Feature Key Location 

source 1..3050 



Qualifier 

/organism="Mus musculus" 
/moT-type="mRNA" 
/strain= M C57BL/6J M 
/db-xref="FANTOM-DB:9430077M22 M 
/db-xref="taxon : 10090" 
/clone= M 9430077M22" 
/tissue-type= n embryonic body 
between diaphragm region and neck" 
/clone-! ib="RIKEN full-length 
enriched mouse cDNA library" 
/dev-stage="12 days embryo" 



misc-feature 1..3050 /note="unc5 homolog (c. elegans) 3 

(MGD | MGI: 1095412, GB | NM-009472 , 
evidence: BLASTN, 99%, match=464) n 

SEQUENCE (SEQ): 

1 ggttatttct caggactgcc tggcggtggc cggatccagc ctcctgcctg gctgggcttt 
61 cggctgtttg cgcgtctcct ggtggcgttt cccttccccg tacacctctg ccgacgatga 
121 ggaaaggtct gagggcgaca gcggcccgct gcggactggg actaggatac ttgctgcaga 
181 tgcttgtgtt acctgccctg gccctgctaa gcgccagtgg caccggctcc gccgctcaag 
241 atgatgaatt ttttcacgaa ctcccagaaa cctttccatc tgacccacct gagccattgc 
301 cacacttcct cattgagccc gaggaagctt acattgtgaa gaacaagcct gtgaacctgt 
361 attgtaaagc cagccctgcc acccagatct acttcaagtg caacagcgag tgggttcatc 
421 agaaggacca cgtagtagac gagagagtag atgaaacctc tggtgagttt ggggttgtgg 
481 ctcaggactc actccacggt gctgttgtca caatactact acaggtagca aggctgaaaa 
541 tggagttctt atcttttggc aaaagccttt gaatcagtta ggttaagaat taatgatctc 
601 attttatgca atgcaaagca atccatcaat gataaatgct ttattgttga tgaactatgt 
661 gactgttcaa taagatacat aatatggaat cgatagaatg aataaatggg aacatcctgg 
721 aaaatgatgt ctttgaattt cacttgtaca gaaatcaaag ataaatctct tgcaatccag 
781 gggtcatgtg caaatgagaa tacattgagc ttgatgacat cattgcccaa cattggtctg 
841 catacatacc tgatattatt ttacagaata ttttaaataa ttccttgaaa attagctacc 
901 agtgaaactt tgtgatgttt tagttcacaa ttaccaatga aagatagtta tgcaatacct 
961 atatttctct taaattagaa gagagaaatg tttcttctgt ggcttaagtg ttttcatctc 
1021 cagcagggag actagcactt aaaacattca gatctaaata tgttatggtt ttaaactata 
1081 tagtaatgtt aagacattat taatcaacat ttctaagcta acaggaatta aaatgacaga 
1141 aatgagaaac atttttacat cttatctttc tacatacaga gagcagattc agggtttcaa 
1201 atgtgcaaat aattccagtg tgtgcctatc tattggctta atctgaaaca aggaaagaac 
1261 agcacattcc acatccccag caacaatatc aattgaatag atggtaatca aaggcataga 
1321 ctaggctatt gattaagaag atgcatctca aaaggacttg taaaacatgt gtgtacatag 
1381 tgtaagtggt gacaggaaat caagtggcag ctgtaggtaa tggctgttca tctgctagtc 
1441 agtgtgttca ctgactgcaa gctgctgact gcagaaaaca gcgattatcc aatcacaggc 
1501 aagcttacag cagggaaatg cctcacctct tcatctaatt actcagaaaa taactctacc 
1561 agacgttgac aacaatctca aaagaactga atattttgat ttcagctcat ttatttctca 
1621 agatttcata tacatatact tttaataatg catgcaaggt atcagaaatg taaaaaggag 
1681 aatcaatgta tttagacatg tcttaataat taatagaaaa gtttgtgtga gaagattata 
1741 aataaaaatt aaatgtctga tattcagcac ttctgagaac ttctccatat gttagtaaca 
1801 tacacacaca cacacacaca cacacacaca cacacacaca cacacaaatt caggatgaag 
1861 acatgagagt taagggaaaa gttccagtac atgaggccaa agagagaatc aaatgttgta 
1921 caaattttgt tggcttttat ttctaaatga aattttgcca gagagttagc aattatcctt 
1981 tggtgccaaa ccagacacac atacaaaggt atctacagac actagaactg agagaatcat 
2041 tgtaacctat taaaaagcat tttagagagc atatatcagt aagatgtgta aaagttctaa 
2101 ggaaagcaga aaactaccat ttatgacata tgacataatt ttatatacat ggaaaattta 
2161 tgcagtttca aagaaaaaaa atacaaattt tgagaatata gtaaattagg tgctgaaaat 
2221 agtgtgactc attatttttt ctaggtatag attgtcaatt caaattagca aggtatgtca 
2281 tttagaatat caatgtaaaa gcaattatat agttatgagg ttaagacatt tgaaggattt 
2341 ttttcctttg tgatactggg gatagatcat agacgtcatg catgccgggc atgggattca 
2401 ccactaagca catacccagc tttagaagga acattttcaa attctaccta gttataaaaa 
2461 taattgtgta attgtgcatg aacacaacac acacacaaat atacacacat gtatatattg 
2521 ttcttgaata tatgtgataa atatttttga tatacatgcc atctataagt ggttctgaaa 
2581 gattaccaca ctttaatgta gagtaacttc taataacagc tcagtgcaac ttgaaataga 
2641 agtgcaatca aataatcttt aaaaacataa aataaaagag atgtgatctt ctctgaagat 
2701 cacacacaga tctacaatta agatttggtt agataaaaca tgaaaatagt ttaaaacttc 
2761 cagaatgtga tgattgtatg accagttaaa aaggagtcga ttagtacatg cattgagata 
2821 tcaaggcagg aagtcagaaa ggaaagaaat agatacacta aatatgtatc ctatttcatt 
2881 atttaaagta agttctagaa gttattgagt gacgtcacca tcatcatgac catcatcatc 
2941 accaccacca ccatcatcat caccatcacc atcactatta tcattctggc catttgttgg 
3001 tcggttactc ggtttcagta atgggacagc caaacctttt aaattaaaac 
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LOCUS (LOC): AK034558 GenBank (R) 

GenBank ACC. NO. (GBN) : AK034558 

GenBank VERSION (VER) : AK034558.1 GI: 26084048 

CAS REGISTRY NO. Crn): 492697-53-7 

SEQUENCE LENGTH (SQL): 3052 

molecule type (ci) : mRNA; linear 

DIVISION CODE (CI): High-Throughput CDNA Sequencing 
DATE (DATE): 3 Apr 2004 

DEFINITION (DEF): Mus musculus 12 days embryo embryonic body between 

diaphragm region and neck cDNA, RIKEN full-length 
enriched library, clone :9430006E08 product: *** un c5*** 
homolog (C. elegans) 3, full insert sequence. 

keywords (ST): HTC; CAP trapper 

source: Mus musculus (house mouse) 

ORGANISM (ORGN): Mus musculus 

Eukaryota; Metazoa; chordata; craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 



Sciurognathi ; Muridae; Murinae; Mus 

COMMENT : 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc . ri ken . j p/ 
URL : http : / It antom . gsc . ri ken . j p/ . 



(OS); 



REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO): 
OTHER SOURCE 

REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 



JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI) 



JOURNAL (SO): 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO): 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 
JOURNAL (SO): 



Carninci , P. ; Hayashizaki ,Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol., 303, 19-44 (1999) 
CA 131:318304 
2 

Carninci ,P.; shibata,Y. ; Hayatsu,N.; Sugahara,Y. ; 
Shi bata ,K. ; Itoh , M . ; Konno , H . ; Okazaki ,Y. ; 
Muramatsu,M. ; Hayashizaki ,Y. 

Normalization and subtraction of cap-trapper-selected 

cDNAs to prepare full-length cDNA libraries for rapid 

discovery of new genes 

Genome Res., 10 (10), 1617-1630 (2000) 

CA 134:305920 

3 

shibata,K.; itoh,M.; Aizawa,K.; Nagaoka.s.; Sasaki, N.; 
Carninci, p.; Konno, H.; Akiyama,J.; Nishi,K.; 
Kitsunai,T.; Tashiro,H.; Itoh,M.; Sumi,N.; lshii,Y.; 
Nakamura,s.; Hazama,M.; Nishine,T.; Harada,A. ; 
Yamamoto,R.; Matsumoto,H. ; Sakaguchi ,s. ; lkegami,T.; 
Kashiwagi ,K. ; Fujiwake,s.; inoue.K.; Togawa,Y.; 
lzawa,M.; Ohara,E.; Watahiki,M.; Yoneda,Y. ; 
lshikawa,T.; Ozawa,K.; Tanaka,T. ; Matsuura,S.; 
Kawai,J.; Okazaki ,Y. ; Muramatsu,M. ; lnoue,Y.; Kira,A.; 
Hayashizaki , Y. 

RIKEN integrated sequence analysis (RISA) 

system — 384-format sequencing pipeline with 384 

multi capillary sequencer 

Genome Res., 10 (11), 1757-1771 (2000) 

4 

The RIKEN Genome Exploration Research Group Phase II 
Team; the FANTOM Consortium. 

Functional annotation of a full-length mouse cdna 
collection 

Nature, 409, 685-690 (2001) 

CA 134:203311 

5 

The FANTOM Consortium; the RIKEN Genome Exploration 
Research Group Phase I & II Team. 

Analysis of the mouse transcriptome based on functional 

annotation of 60,770 full-length cDNAs 

Nature, 420, 563-573 (2002) 

CA 138:131939 

6 (bases 1 to 3052) 

Adachi,J.; Aizawa,K.; Akimura,T. ; Arakawa,T.; Bono,H.; 
carninci, P.; Fukuda,s.; Furuno,M.; Hanagaki,T.; 
Hara,A.; Hashizume,w. ; Hayashida,K. ; Hayatsu,N.; 
Hiramoto,K.; Hiraoka,T.; Hirozane,T.; Hori,F.; 
Imotani.K.; lshii,Y.; ltoh,M.; Kagawa,I.; Kasukawa,T.; 
Katoh,H.; Kawai,J.; Kojima,Y.; Kondo,S.; Konno, H.; 
Kouda,M.; Koya,S.; Kurihara,C. ; Matsuyama,T. ; 
Miyazaki,A.; Murata,M.; Nakamura.M.; Nishi.K.; 
Nomura , K . ; Numazaki , R . ; ohno , M . ; ohsato , N . ; okazaki , Y . ; 
Saito,R.; Saitoh, H.; Sakai,C; Sakai,K.; Sakazume,N.; 
Sano,H.; Sasaki, D.; Shibata,K.; shinagawa,A. : 
shiraki.T.; Sogabe,Y.; Tagami,M. 
Takahashi , F. ; Takaku-Akahi ra,S. ; 
Tomaru,A.; Toya,T. ; Yasunishi,A. 
Hayashizaki ,Y. 
Direct Submission 

Submitted (16-JUL-2001) Yoshihide Hayashizaki, The 
institute of Physical and Chemical Research (RIKEN), 
Laboratory for Genome Exploration Research Group, RIKEN 
Genomic Sciences Center (GSC) , RIKEN Yokohama 
institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome- 



Tagawa,A. 
Takeda,Y. ; Tanaka,T. 
; Muramatsu,M. ; 



res@gsc . ri ken . j p , URL : http : //genome . gsc . ri ken . i p/ , 
Tel :81-45-503-9222, Fax:81-45-503-9216) 



FEATURES (FEAT): 

Feature Key Location 

source 1..3052 



misc-feature 1. .3052 



Qualifier 

/orqanism= ,, Mus musculus" 
/moT-type="mRNA" 
/strain= M c57BL/6D" 
/db-xref="FANTOM-DB:9430006E08" 
/db-xref ="taxon : 10090" 
/clone="9430006E08" 
/tissue-type= M embryonic body 
between diaphragm region and neck" 
/clone-! ib="RiKEN full-length 
enriched mouse cDNA library" 
/dev-stage="12 days embryo" 
/note="unc5 homolog (C. elegans) 3 
(MGD I MGI : 1095412 , GB | NM-009472 , 
evidence: BLASTN, 99%, match=464) M 



SEQUENCE (SEQ): 

1 ggttatttct caggactgcc tggcggtggc cggatccagc ctcctggcct ggctgggctt 
61 tcggctgttt gcgcgtctcc tggtggcgtt tcccttcccc gtacacctct gccgacgatg 
121 aggaaaggtc tgagggcgac agcggcccgc tgcggactgg gactaggata cttgctgcag 
181 atgcttgtgt tacctgccct ggccctgcta agcgccagtg gcaccggctc cgccgctcaa 
241 gatgatgaat tttttcacga actcccagaa acctttccat ctgacccacc tgagccattg 
301 ccacacttcc tcattgagcc cgaggaagct tacattgtga agaacaagcc tgtgaacctg 
361 tattgtaaag ccagccctgc cacccagatc tacttcaagt gcaacagcga gtgggttcat 
421 cagaaggacc acgtagtaga cgagagagta gatgaaacct ctggtgagtt tggggttgtg 
481 gctcaggact cactccacgg tgctgttgtc acaatactac tacaggtagc aaggctgaaa 
541 atggagttct tatcttttgg caaaagcctt tgaatcagtt aggttaagaa ttaatgatct 
601 cattttatgc aatgcaaagc aatccatcaa tgataaatgc tttattgttg atgaactatg 
661 tgactgttca ataagataca taatatggaa tcgatagaat gaataaatgg gaacatcctg 
721 gaaaatgatg tctttgaatt tcacttgtac agaaatcaaa gataaatctc ttgcaatcca 
781 ggggtcatgt gcaaatgaga atacattgag cttgatgaca tcattgccca acattggtct 
841 gcatacatac ctgatattat tttacagaat attttaaata attccttgaa aattagctac 
901 cagtgaaact ttgtgatgtt ttagttcaca attaccaatg aaagatagtt atgcaatacc 
961 tatatttctc ttaaattaga agagagaaat gtttcttctg tggcttaagt gttttcatct 
1021 ccagcaggga gactagcact taaaacattc agatctaaat atgttatggt tttaaactat 
1081 atagtaatgt taagacatta ttaatcaaca tttctaagct aacaggaatt aaaatgacag 
1141 aaatgagaaa catttttaca tcttatcttt ctacatacag agagcagatt cagggtttca 
1201 aatgtgcaaa taattccagt gtgtgcctat ctattggctt aatctgaaac aaggaaagaa 
1261 cagcacattc cacatcccca gcaacaatat caattgaata gatggtaatc aaaggcatag 
1321 actaggctat tgattaagaa gatgcatctc aaaaggactt gtaaaacatg tgtgtacata 
1381 gtgtaagtgg tgacaggaaa tcaagtggca gctgtaggta atggctgttc atctgctagt 
1441 cagtgtgttc actgactgca agctgctgac tgcagaaaac agcgattatc caatcacagg 
1501 caagcttaca gcagggaaat gcctcacctc ttcatctaat tactcagaaa ataactctac 
1561 cagacgttga caacaatctc aaaagaactg aatattttga tttcagctca tttatttctg 
1621 aagatttcat atacatatac ttttaataat gcatgcaagg tatcagaaat gtaaaaagga 
1681 gaatcaatgt atttagacat gtcttgataa gttaataaaa aagtttgtgt gagaagatta 
1741 taaatagaaa ttaaatgtct gatattcagc acttgtgaga acttctccat atgttagtaa 
1801 catacacaca cacacacaca cacacacaca cacacacaca cacacacaaa ttcaggatga 
1861 agacatgaga gttaagggaa aagttccagt acatgaggcc aaagagagaa tcaaatgttg 
1921 tacaaatttt gttggctttt atttgtaaat gaaattttgc cagagagtta gcaattatcc 
1981 tttggtgcca aaccagacac acatacaaag gtatctacag acactagaac tgagagaatc 
2041 attgtaacct attaaaaagc attttagaga gcatatatca gtaagatgtg taaaagttct 
2101 aaggaaagca gaaaactacc atttatgaca tatgacataa ttttatatac atggaaaatt 
2161 tatgcagttt caaagaaaaa aaatacaaat tttgagaata tagtaaatta ggtgctgaaa 
2221 atagtgtgac tcattatttt ttctaggtat agattgtcaa ttcaaattag caaggtatgt 
2281 catttagaat atcaatgtaa aagcaattat atagttatga ggttaagaca tttgaaggat 
2341 ttttttcctt tgtgatactg gggatagatc atagacgtca tgcatgccgg gcatgggatt 
2401 caccactaag cacataccca gctttagaag gaacattttc aaattctacc tagttataaa 
2461 aataattgtg taattgtgca tgaacacaac acacacacaa atatacacac atgtatatat 
2521 tgttcttgaa tatatgtgat aaatattttt gatatacatg ccatctataa gtggttctga 
2581 aagattacca cactttaatg tagagtaact tctaataaca gctcagtgca acttgaaata 
2641 gaagtgcaat caaataatct ttaaaaacat aaaataaaag agatgtgatc ttctctgaag 
2701 atcacacaca gatctacaat taagatttgg ttagataaaa catgaaaata gtttaaaact 
2761 tccagaatgt gatgattgta tgaccagtta aaaaggagtc gattagtaca tgcattgaga 
2821 tatcaaggca ggaagtcaga aaggaaagaa atagatacac taaatatgta tcctatttca 
2881 ttatttaaag taagttctag aagttattga gtgacgtcac catcatcatg accatcatca 
2941 tcaccaccac caccatcatc atcaccatca ccatcactat tatcattctg gccatttgtt 
3001 ggtcggttac tcggtttcag taatgggaca gccaaacctt ttaaattaaa ac 

L2 ANSWER 100 OF 104 GENBANK. RTM . COPYRIGHT 2004 on STN 



LOCUS (loc): 

GenBank ACC. no. (GBN) 

GenBank version (ver) 

CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF) : 

KEYWORDS (ST): 
SOURCE: 
ORGANISM (ORGN): 



BC021657 GenBank (R) 
BC021657 

BC021657.1 Gl: 18203773 
386533-01-3 
6420 

mRNA ; linear 
Rodents 
3 Oct 2003 

Mus musculus ankyrin 3, epithelial, mRNA (cDNA clone 
MGC: 14049 IMAGE : 4188590) , complete cds. 
MGC 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 
NUCLEIC ACID COUNT (NA) : 1884 a 1623 c 1620 g 1293 t 
COMMENT : 

Contact: MGC help desk 

Emai 1 : cgapbs - rOmai 1 . ni h . gov 

Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: institute for Systems Biology 

http://www.systemsbiology.org 

contact: amadan@systemsbiology.org 

Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha 
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle whiting 
clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 18 Row: k Column: 4 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 25121951. 



REFERENCE: 

AUTHOR (AU): 



TITLE (TI): 

JOURNAL (SO) : 

OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO) : 



1 (bases 1 to 6420) 
strausberg,R.L. ; Feingold,E.A. ; Grouse, L.H.; 
Derge.J.G.; Klausner,R.D. ; Collins, F. S. ; Wagner, L.; 
Shenmen,C.M. ; schuler ,G.D. ; Altschul ,S.F. ; Zeeberg,B.; 
Buetow,K.H.; Schaefer,C.F. ; Bhat,N.K.; Hopkins, R. F. ; 
Jordan, H.; Moore, T. ; Max,s.l.; Wang, J.; Hsieh,F.; 
Diatchenko,L. ; Marusina,K.; Farmer, A. A.; Rubin, G.M.; 
Hong,L.; Stapleton.M. ; Soares,M.B.; Bonal do ,M . F. ; 
Casavant,T.L. ; Scheetz,T.E. ; Brownstei n ,M . J . ; 
Usdin,T.B.; Toshiyuki ,s. ; carninci,P.; Prange,c.; 
Raha,s.s.; Loquellano,N.A. ; Peters, G. J.; Abramson , R.D. ; 
Mullahy,S.J.; Bosak,S.A.; McEwan,P.J.; McKernan , K . J . ; 
Malek,J.A.; Gunaratne,p.H. ; Richards, S.; Worley ,K.C. ; 
Hale,S.; Garcia, A.M.; Gay,L.J.; Hulyk,S.W.; 
Villalon,D.K. ; Muzny , D.M. ; Sodergren , E . J . ; Lu,X.; 
Gibbs,R.A.; Fahey, J.; Helton, E.; Ketteman, M.; Madan, A.; 
Rodrigues, s. ; Sanchez, A.; Whiting, M.; Madan, A.; 
Young, A. c; shevchenko,Y. ; Bouffard,G.G. ; 
Blakesley,R.w. ; Touchman , J .w. ; Green, E.D.; 

Dickson, M. C. ; Rodriguez, A. c. ; Grimwood,J.; schmutz,J.; 
Myers, R.M.; Butterfield, Y.s . ; Krzywinski ,M. I. ; 
skalska,u.; Smailus ,D.E. ; Schnerch,A.; Schein,J.E.; 
Jones, S.J. ; Marra,M.A. 

Generation and initial analysis of more than 15,000 

full -length human and mouse cDNA sequences 

Proc. Natl. Acad. Sci . U.S.A., 99 (26), 16899-16903 

(2002) 

CA 138:84325 

2 (bases 1 to 6420) 
Strausberq,R. 
Direct Submission 

Submitted (14-JAN-2002) National Institutes of Health, 
Mammalian Gene Collection (MGC), Cancer Genomics 
office, National Cancer Institute, 31 Center Drive, 
Room 11A03, Bethesda, MD 20892-2590, USA 



FEATURES (FEAT) 
Feature Key 



Location 



Qua! ifier 



source 



1..6420 



/orqanism="Mus musculus" 
/moT-type= ,, mRNA M 
/strain="FVB/N" 
/db-xref="taxon : 10090" 



/cl one="MGC : 14049 IMAGE : 4188590" 
/tissue-type="salivary gland, 10 
week old female mouse" 
/cl one-1 i b="NCI-CGAP-SG2 " 
/lab-host= ,, DH10B" 
/note="vector: pCMV-SPORT6" 
/gene="Ank3" 

/note="synonym: MGC14049" 

/db-xref= ,, LocuslD:11735 n 

/db-xref= M MGI: 88026" 

/codon-start=l 

/product="Ank3 protein" 

/protein-id="AAH21657.1" 

/db-xref="Gl: 18203774" 

/db-xref="LocusID: 11735" 

/transl ation="MSEEPKEKPAKPAHRKRKGK 

KSDANASYLRAARAGHLEKALDYI 

KNGVDVNICNQNGLNALHLASKEGHVEWSELLQ 

REANVDAATKKGNTALHIAS LAGQ 

AEWKVLVTNGANVNAQSQNGFTPLYMAAQENHL 

EWRFLLDNGASQSLATEDGFTPL 

AVA LQQGH DQWS L L L E N DTKGKVR L PA LH I AA R 

KDDTKAAAL L LQN DTNADVE5KSG 

FTPLHIAAHYGNINVATLLLNRAAAVDFTARNDI 

TPLHVASKRGNANMVKLLLDRGAK 

IDAKTRDGLTPLHCGARSGHEQWEMLLDRSAPI 

LSKTKNGLSPLHMATQGDHLNCVQ 

L L LQH N V PVDDVTN DYLTA L H VAAHCGH YKVAKV 

LLDKKASPNAKALNGFTPLHIACK 

KNRIRVMELLLKHGASIQAVTESGLTPIHVAAFM 

GHVNIVSQLMHHGASPNTTNVRGE 

TALHMAARSGQAEWRYLVQDGAQVEAKAKDDQT 

PLHISARLGKADIVQQLLQQGASP 

NAATTSGYTPLH LAAREGH EDVAAFLLDHGAS LS 

ITTKKGFTPLHVAAKYGKLEVASL 

LLQKSASPDAAGKSGLTPLHVAAHYDNQKVALLL 

LDQGAS PHAAAKNGYTPLHIAAKK 

NQMDIATSLLEYGADANAVTRQGIASVHLAAQEG 

HVDMVSLLLSRNANVNLSNKSGLT 

PLHLAAQEDRVNVAEVLVNQGAHVDAQTKMGYTP 

PHVGCHYGNIKIVN FLLQHSAKVN 

AKTKN GYTA LHQAAQQGHTH 1 1 N VL LQN N AS PN E 

LTVNGNTALAIARR LGYISWDTL 

KWTEEIMTTTTITEKHKMNVPETMNEVLDMSDD 

EGEDAITGDTDKYLGPQDLKELGD 

DSLPAEGYVGFSLGARSASLRSFSSDRSYTLNRS 

SYARDSMMIEELLVPSKEQH LTFT 

REFDSDSLRHYSWAADTLDNVNLVSSPVHSGFLV 

S FMVDARGGSMRGS RHHGMRII I P 

PRKCTAPTRITCRLVKRHKLANPPPMVEGEGLAS 

RLVEMGPAGAQFLGPVIVEIPHFG 

SMRGKERELIVLRSENGETWKEHQFDSKNEDLAE 

LLNGMDEELDSPEELGTKRICRII 

TKDFPQYFAWSRIKQESNQIGPEGGILSSTTVP 

LVQASFPEGALTKRIRVGLQAQPV 

PEETVKKILGNKATFSPIVTVEPRRRKFHKPITM 

T I P V P P PS G EGVS N G YKGD AT PN L 

RLLCS ITGGTS PAQWE DITGTT P LT FI KDCVS FT 

TNVSARFWLADCHQVLETVGLASQ 

L YRE L I CV P YMAK FW FAKTN D P VES S L RC FCMT 

DDRVDKTLEQQENFEEVARSKDIE 

VLEGKPIYVDCYGNLAPLTKGGQQLVFNFYSFKE 

NRLPFSIKIRDTSQEPCGRLSFLK 

EPKTTKGLPQTAVCNLNITLPAHKKAEKADRRQS 

FAS LALRKRYS YLTEPSMS PQS PC 

ERTDIRMAIVADHLGLSWTELAREL 

NFSVDEINQIRVENPNSLISQSFMLLKKWVTRD 

GKNATTDALTSVLTKINRIDIVTLLEGPIFDYGN 

ISGTRSFADENNVFHDPVDGWQNE 

TPSGSLESPAQARRLTGGLLDRLDDSSDQARDSI 

TSYLTGEPGKIEANGNHTAEVIPE 

AKAKPYFPESQNDIGKQSIKENLKPKTHGCGRTE 

E PVS P LTA YQKS L E ETS K LVI E DA 

PKPCVPVGMKKMTRTTADGKARLNLQEEEGSTRS 

EPKQGEGYKVKTKKEIRNVEKKTH " 

/note="Arp; Region: FOG: Ankyrin 



misc-feature 3214. .3528 



misc-feature 4672. .4953 



repeat [General function 
prediction only]" 
/db-xref= ,, CDD:coG0666" 
/note="ZU5; Region: ZU5 domain. 
Domain present in ZO-1 and 
Unc5-like netrin receptors Domain 
of unknown function" 
/db-xref="CDD:pfam00791" 
/note="DEATH; Region: DEATH 
domain, found in proteins involved 
in cell death (apoptosis). 
Alpha-helical domain present in a 
variety of proteins with apoptotic 
functions. Some (but not all) of 
these domains form homotypic and 
heterotypic dimers" 
/db-xref="CDD:smart00005" 



SEQUENCE (SEQ): 

1 ccacgcgtcc gggagaagca agccctgaaa tgttgtcaag atcttttctc ggggtaacga 
61 ctggctaaaa aaacaaataa aaagagagag aaaggaagaa aaagaaagaa ggaagaaagg 
121 aaggaagaaa acaaagggtt ggaatattga tttttctttg gaggaataac cagggctttg 
181 ctggagagaa cacccagttc ctccctgagg cgagctgagt tgcccgtgac agcagccagg 
241 gctggcttga agcagccgct ctgcctgggg agttccctaa atggactttg gtctcaatgc 
301 tgtgactctt tgccgtgggt ttggatgtgg gaacacggaa ggtgatctcc tgcctcgtct 
361 caactcccct gatctcaagg aaagatgagt gaagagccaa aggagaagcc cgccaagcct 
421 gctcatagga agaggaaagg aaaaaagtct gatgccaacg caagttactt aagagcagct 
481 cgggcagggc acctggaaaa ggcccttgac tacatcaaaa atggagtgga cgtcaacatc 
541 tgtaaccaga atggattgaa tgcactccat cttgcttcca aagaaggcca tgtggaagtg 
601 gtctctgagc tgctgcagag ggaagccaat gttgatgccg ccacaaagaa aggaaacacg 
661 gccttacaca tcgcatcttt ggctgggcaa gcggaagtgg tcaaggtctt ggttacgaac 
721 ggagcgaatg tcaacgcaca atctcagaat ggcttcacac cattgtatat ggcagcccag 
781 gagaaccacc tggaagtcgt caggtttctt ctggacaatg gcgccagcca aagcctggcc 
841 acagaggacg gcttcacgcc attggccgtg gctctgcaac aaggtcatga ccaagtcgtg 
901 tccctcctgc tcgagaacga cacgaaggga aaagtgcgcc tcccagccct ccacatcgca 
961 gcccggaaag acgacaccaa ggcagcagct ctgctcctgc agaatgacac aaacgcggac 
1021 gtggagtcaa agagtggctt caccccgctc cacatagctg cccactatgg gaacatcaat 
1081 gtggccacgt tgctgttaaa ccgagcggct gctgtggact tcaccgcacg gaatgacatc 
1141 actcccttac acgttgcctc gaagcgagga aatgcaaata tggtgaagct attgctggac 
1201 cggggtgcga agatcgatgc caagaccagg gacggtctga ctccgttgca ctgtggggcg 
1261 agaagtggcc atgagcaggt ggtagagatg ttgcttgaca gatccgcccc catcctttca 
1321 aaaaccaaga atggattgtc gccactgcac atggccacac aaggagacca tttaaactgc 
1381 gtccaactcc tcctccagca caacgtgccc gtggacgacg tcaccaacga ctacctgact 
1441 gccctccatg tggctgccca ctgcggccat tacaaagttg ccaaggttct tttggataag 
1501 aaagctagcc ccaatgccaa agccctgaat ggcttcaccc ctctccatat cgcctgcaaa 
1561 aagaaccgca tccgagtaat ggaactcctt ttgaagcacg gtgcatctat tcaagccgta 
1621 accgagtcgg gccttacccc aatccatgtt gctgccttca tgggacatgt aaatatcgtg 
1681 tcacagctaa tgcatcatgg agcctcccca aacaccacca atgtgagagg agagacggca 
1741 ttgcatatgg cggctcggtc cggacaagca gaagtggtgc ggtatctggt ccaagatggg 
1801 gctcaggtag aagcaaaagc taaggatgac cagactccac tccacatctc agcccgactt 
1861 gggaaagctg acatagtgca acaactgtta cagcaaggag catcccccaa tgcagcaaca 
1921 acttctgggt acacccccct tcaccttgcg gccagagagg ggcatgagga tgtagctgcg 
1981 ttcctcctgg atcatggagc atctttatcc ataacaacaa agaagggatt cacccctctg 
2041 cacgtggcag ccaaatacgg aaagcttgaa gtcgcaagtc tcctgctgca gaagagtgcg 
2101 tctcccgatg ccgcagggaa gagcgggcta actccactgc atgtagcagc gcattacgat 
2161 aatcagaaag tggcccttct gctcttggac cagggagcct caccccacgc agccgcaaag 
2221 aatggctata caccactgca catcgcggcc aagaagaacc agatggacat agccacgtcc 
2281 ctgctggagt acggtgctga tgcaaacgcg gttacccggc aagggattgc gtccgtccat 
2341 cttgcggcac aggaagggca cgtggacatg gtgtcgctgc tcctgagtag aaacgcgaat 
2401 gtcaacctga gcaataagag cggtctcacc ccactccacc tggctgctca agaagaccga 
2461 gtgaatgtgg ccgaggtcct tgtcaaccag ggggcccatg tggatgctca gacaaagatg 
2521 ggctacaccc cgccccatgt gggctgtcac tatggaaata tcaaaatagt caattttctg 
2581 ctgcagcatt ctgcaaaagt taatgccaag acgaagaatg gatacacagc actgcaccag 
2641 gctgctcagc agggccacac gcatatcatc aatgtcttgc ttcagaacaa cgcctccccc 
2701 aatgaactca ctgtgaatgg gaacacagct ctggccatcg cccggcgcct tggttacatc 
2761 tcggtggttg acacactgaa ggtcgtgacg gaggaaatta tgaccaccac taccatcacg 
2821 gagaagcaca aaatgaatgt cccagaaacg atgaatgaag tcctcgatat gtcagacgat 
2881 gaaggtgaag atgccatcac aggggacact gacaagtatc tcgggccaca ggaccttaag 
2941 gagctaggtg atgactccct gccagcagaa ggttacgtag gcttcagtct tggagcccgt 
3001 tctgccagcc tccgctcctt cagttcggat aggtcctaca ccttgaacag aagctcctac 
3061 gcaagggaca gcatgatgat agaggaactt ctggtaccat ccaaagagca gcacctgacg 
3121 ttcacgaggg agtttgattc tgactccctc agacactaca gttgggcagc ggacacgtta 
3181 gataatgtga acctggtctc aagcccggtg cattctgggt ttctggttag ctttatggtg 
3241 gacgcgagag ggggctccat gcgaggaagc cgccaccacg ggatgcggat catcatccct 
3301 ccgcgaaagt gtacggcccc cacccgcatc acgtgccgcc tggtaaagag acataaactg 
3361 gccaacccac cccccatggt ggaaggagag ggattagcca gtaggctggt agaaatgggt 



3421 cctgcggggg cacaattttt aggccccgtc attgtggaaa tccctcattt tgggtccatg 
3481 agggggaagg agagagaact tatcgtcctt cggagcgaga acggagagac ctggaaggaa 
3541 catcagtttg acagtaaaaa cgaagacctc gcggagcttc tcaatggcat ggatgaagaa 
3601 ctcgacagcc cggaagagtt gggtacaaag cgcatctgca gaattatcac aaaggatttc 
3661 ccccagtatt ttgccgtggt ttcccggatt aagcaggaaa gcaaccagat cggtcctgag 
3721 ggtgggattc tgagcagcac caccgtgccc ctcgtccagg cctccttccc agagggcgcc 
3781 ttaaccaaga ggatccgtgt gggtctccag gctcagcccg tgccagagga aacggtaaaa 
3841 aaaatccttg ggaacaaagc aacatttagc ccaattgtca cggtagagcc gaggagaagg 
3901 aagttccata agccgatcac catgaccatt ccggtgcccc cgccctcggg agaaggcgtg 
3961 tccaatgggt acaaggggga tgccacgccc aacctgcggc tcctctgcag catcacagga 
4021 ggcacctcac cagctcaatg ggaagacatc acaggaacaa cccctctgac gttcataaag 
4081 gattgtgtgt ctttcacaac caacgtttca gccagattct ggctggcgga ctgccatcag 
4141 gtgttagaga ccgtagggct agcctcccag ctgtacagag agctgatatg cgttccctac 
4201 atggccaagt tcgttgtgtt tgccaaaaca aacgacccgg tggagtcctc gctgaggtgc 
4261 ttctgtatga cagacgacag ggtggacaaa accctggagc agcaggagaa cttcgaggag 
4321 gttgccagaa gcaaagacat tgaggttctg gaaggaaagc ccatctacgt tgattgctat 
4381 ggaaacctgg cccctctgac caaaggagga cagcagcttg tttttaactt ttattctttc 
4441 aaagaaaaca gactgccatt ttccatcaag atcagagaca ccagtcaaga gccctgtggc 
4501 cgcctgtctt tcctgaagga gccaaagaca acaaagggat taccccaaac agctgtttgc 
4561 aacttaaata ttactctgcc ggcacataaa aaggctgaga aggcagacag acgccagagc 
4621 tttgcctccc tagctttacg taagcgctac agctacttga ctgaacccag catgagtccg 
4681 cagagtcctt gtgagcggac ggatatcagg atggcgatag tagccgatca cctgggactt 
4741 agttggacag agctggcaag ggaactgaat ttttcagtgg atgaaatcaa ccaaatacgt 
4801 gtggaaaatc ccaattcttt aatttctcag agcttcatgt tattaaaaaa gtgggtgacc 
4861 agagacggaa agaatgccac aactgatgcc ttaacttcgg tcttaacgaa gattaaccgg 
4921 atagacattg taactctgct ggaaggacca atatttgatt atgggaatat ttcaggcacc 
4981 agaagctttg cagatgaaaa caatgttttc catgacccag ttgatggttg gcagaacgag 
5041 acgccaagtg gaagcctaga gtccccagcg caagctcgaa gactaactgg tgggttactg 
5101 gaccgtctgg atgacagctc tgaccaggct cgggattcta ttacctcata cctcacggga 
5161 gaacctggga agatcgaagc aaatggaaac cacacagcgg aagtcattcc agaagcaaag 
5221 gcaaaaccct acttcccgga atcccaaaac gatataggga aacagagcat caaggagaac 
5281 ctgaaaccaa aaacacacgg atgtggtcgc actgaggaac cagtgtcgcc cctcacagcc 
5341 taccagaaat ctctggaaga aaccagcaag cttgtcatag aagacgcacc taaaccctgt 
5401 gtgcctgtcg gcatgaaaaa gatgaccagg actacggctg acggcaaagc caggctcaac 
5461 ctccaggaag aagaggggtc caccaggtca gagcctaagc agggagaagg ctataaggtg 
5521 aagacgaaga aggaaatccg gaacgtggag aagaaaaccc actagtgaca gtgacgtcag 
5581 tccaggacca cttggtcata ctgccagtat tgagaaactc accagagatc agcaggaaga 
5641 aaaatagtca ccaagagtgg agtgtgggtc agctgcttcc accacacgtc agtgacacga 
5701 tttttttttt gtaaacgcag aaggaaaacg aaaatcacat tttttctttc atttagcatg 
5761 agccgatcta caaagcatgg aaactcacat tcattcccgg cactgaaatg cacactgagc 
5821 aagtgcagtg tacgacacag gaagccatgt tcttcagacc cggttctcaa gtaatttagt 
5881 gtcattttca aagaagacag acaaaacaac acctgtgcca cccaggagga ggggcagcag 
5941 aggacggaat gatcaaatca agaagacaca ctaaaattac taaattcaca acagctcgcc 
6001 ttatttttct cggacccaaa ctaactgtca gggtataaac actgttcctt tctttctttc 
6061 tttctttctt tctttttttt tttaaagaaa aataataata ataataatga aaaaaatcaa 
6121 tagttttgct gtaaataata acactgtaaa ttctaacaga agaaactaca ataaagaatt 
6181 ataaggcacc cgcctcacag aacacaaaga tatcacaaat gcatccaagg acaggcgtcc 
6241 caagtgactt caccctggac aaggatgggg atccagcgct ctcgggactc tttcactgac 
6301 tcttgtatat tagcaattat gtttgtacac gcaaaactgc tagcttgatt ttaaaagaaa 
6361 aaaaatgcaa atctttaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
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locus (LOC): BC009333 GenBank (R) 

GenBank ACC. NO. (GBN) : BC009333 

GenBank VERSION (VER) : BC009333.2 Gl:40226527 

CAS REGISTRY NO. (RN) : 342081-81-6 

SEQUENCE LENGTH (SQL): 2688 

MOLECULE type (Cl) : mRNA; linear 

DIVISION code (CI): Primates 

DATE (DATE): 25 Mar 2004 

DEFINITION (DEF): Homo sapiens unc-5 homolog A (c. elegans), mRNA (cdna 

clone IMAGE : 4126760) , partial cds . 
SOURCE: Homo sapiens (human) 

ORGANISM (ORGN): Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; 

Euteleostomi ; Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae; Homo 

COMMENT: 

On Dec 19, 2003 this sequence version replaced gi:14424611. 

Contact: MGC help desk 

Email : cgapbs-rOmail .nih.gov 

Tissue Procurement: ATCC 

cdna Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: National Institutes of Health Intramural 



Sequencing Center (NISC) , 
Gaithersburg, Maryland; 
Web site: http://www.nisc.nih.gov/ 
Contact : ni sc_mgc@nhgri . ni h . gov 

Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M. , Benjamin , B. , 
Blakesley.R.W. , Bouffard ,G.G. , Breen,K., Brinkley,c, Brooks,s., 
Dietrich ,N. L. , Granite,s., Guan,x., Gupta, J., Haghighi,P., 
Hansen ,N., ho,s.-l., Karlins,E., Kwong,P., Laric,P., Leqaspi,R., 
Maduro.Q.L., Masie"l"lo,c. , Maskeri,B., Mastri an , s . D . , McCloskey , J . c. , 
McDowell, J., Pearson ,R., Stantripop ,S. , Thomas, P. J., Touchman , J .W. , 
Tsurgeon.c, vogt,J.L., Walker, M. A., Wetherby ,K.D. , Wiggins, L., 
Young, A., Zhang, L.-H. and Green,E.D. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
series: iral Plate: 26 Row: g Column: 22. 



reference: 

AUTHOR (AU): 



TITLE (TI): 

JOURNAL (SO): 

OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



1 (bases 1 to 2688) 

strausberg,R.L. ; Feingold,E.A. ; Grouse, L.H.; 
Derge,J.G.; Klausner ,R.D. ; Collins,F.S.; Wagner, L.; 
shenmen,C.M. ; Schuler ,G.D. ; Altschul ,S.F. ; Zeeberg,B.; 
Buetow,K.H.; Schaefer,C.F. ; Bhat,N.K.; Hopkins, R.F. ; 
Jordan, H.; Moore, T. ; Max,s.i.; Wang, J.; Hsieh.F.; 
Diatchenko,L. ; Marusina,K.; Farmer, A. A.; Rubin, G.M.; 
Hong,L.; Stapleton,M. ; Soares,M.B.; Bonaldo ,M. F. ; 
Casavant,T.L. ; Scheetz ,T. E . ; Brownstein ,M. J . ; 
Usdin.T.B.; Toshiyuki ,s. ; Carninci,P.; Prange,c; 
Raha,S.S.; Loquellano,N.A. ; Peters, G. J.; Abramson , R.D. 
Mul 1 ahy , s . J . ; Bosak,S.A.; McEwan,P.J.; McKernan , K . J . ; 
Malek,J.A.; Gunaratne,P.H. ; Richards, s.; Worley,K.C; 
Hale,s.; Garcia, A.M.; Gay,L.J.; Hulyk,S.W.; 
Villalon,D.K. ; Muzny,D.M. ; Sodergren , E . J . ; Lu,X.; 
Gibbs,R.A.; Fahey,J.; Helton, E.; Ketteman,M.; Madan,A. 
Rodrigues,s. ; Sanchez, A.; whiting, M.; Madan.A.; 
Young, A. C. ; Shevchenko,Y. ; Bouffard, G.G. ; 
Blakesley,R.W. ; Touchman, J .W. ; Green, E.D.; 
Dickson, M. c. ; Rodriguez, A. C. ; Grimwood,J.; schmutz,J.; 
Myers, R.M.; Butterfield , Y.s . ; Krzywinski ,M.I. ; 
skalska,u.; smailus,D.E. ; Schnerch,A.; schein,J.E.; 
Jones, s. J.; Marra,M.A. 

Generation and initial analysis of more than 15,000 

full-length human and mouse cDNA sequences 

Proc. Natl. Acad. Sci . U.S.A., 99 (26), 16899-16903 

(2002) 

CA 138:84325 

2 (bases 1 to 2688) 
Strausberg, R. 
Direct Submission 

Submitted (12-JUN-2001) National Institutes of Health, 
Mammalian Gene Collection (MGC), Cancer Genomics 
office, National Cancer Institute, 31 Center Drive, 
Room 11A03, Bethesda, MD 20892-2590, USA 



FEATURES (FEAT): 
Feature Key 



Location 



Qualifier 



source 



1. .2688 



gene 



CDS 



<1. .2688 



<1. .1627 



/orqanism= u Homo sapiens" 

/mol -type^'mRNA" 

/db-xref= M taxon:9606 n 

/cl one=" IMAGE : 4126760" 

/ti ssue-type="Brai n , 

neuroblastoma" 

/cl one-1 i b="NIH-MGC-19" 

/lab-host="DHl0B-R" 

/note="vector: pOTB7" 

/gene="UNC5A" 

/note="synonym: UNC5H1" 

/db-xref="LocusID: 90249" 

/db-xref="MIM:607869" 

/gene="UNC5A" 

/codon-start=2 

/product="UNC5A protein" 

/protein-id="AAH09333.2" 

/db-xref="GI:40226528" 

/db-xref="LocuslD: 90249" 

/db-xref="MIM: 607869" 

/transl ati on="DVALYVGLIAVAVCLVLLLL 

VLILVYCRKKEGLDSDVADSSILT 



misc-feature 413. .724 



misc-feature 1343. .1594 



SGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQG 

S LCPRQDGPS PK FQLTNGH L LS P L 

GGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPR 

GTSNMTYGTFNFLGGRLMIPNTGI 

SLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGC 

QTLLSPIVSCGPPGVLLTRPVILA 

MDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEE 

APSHLYYCQLEASACYVFTEQLGR 

FALVGEALSVAAAKRLKLLLFAPVACTSLEYNIR 

VYCLHDTHDALKEWQLEKQLGGQ 

LIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLL 

VSYQEIPFYHIWNGTQRYLHCTFT 

LERVS PSTSDLACKLWVWQVEGDGQS FS IN FNIT 

KDTRFAELLALESEAGVPALVGPS 

AFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLH 

LDSHLSFFASKPSPTAMILNLWEA 

RH F PNGN L S Q L AAAV AG LGQ P DAG L FT VS EAEC" 

/gene="UNC5A" 

/note="ZU5; Region: Domain present 
in ZO-1 and Unc5-like netrin 
receptors" 

/db-xref= ,, CDD:smart00218" 
/gene="UNC5A n 

/ not e=" DEATH ; Region: DEATH 
domain, found in proteins involved 
in cell death (apoptosis). 
Alpha-helical domain present in a 
variety of proteins with apoptotic 
functions. Some (but not all) of 
these domains form homotypic and 
heterotypic dimers 11 
/db-xref= n CDD:smart00005 n 



SEQUENCE (SEQ): 

1 ggacgtggcc ctctatgtgg gcctcatcgc cgtggccgtc tgcctggtcc tgctgctgct 

61 tgtcctcatc ctcgtttatt gccggaagaa ggaggggctg gactcagatg tggctgactc 

121 gtccattctc acctcaggct tccagcccgt cagcatcaag cccagcaaag cagacaaccc 

181 ccatctgctc accatccagc cggacctcag caccaccacc accacctacc agggcagtct 

241 ctgtccccgg caggatgggc ccagccccaa gttccagctc accaatgggc acctgctcag 

301 ccccctgggt ggcggccgcc acacactgca ccacagctct cccacctctg aggccgagga 

361 gttcgtctcc cgcctctcca cccagaacta cttccgctcc ctgccccgag gcaccagcaa 

421 catgacctat gggaccttca acttcctcgg gggccggctg atgatcccta atacaggaat 

481 cagcctcctc atccccccag atgccatacc ccgagggaag atctatgaga tctacctcac 

541 gctgcacaag ccggaagacg tgaggttgcc cctagctggc tgtcagaccc tgctgagtcc 

601 catcgttagc tgtggacccc ctggcgtcct gctcacccgg ccagtcatcc tggctatgga 

661 ccactgtggg gagcccagcc ctgacagctg gagcctgcgc ctcaaaaagc agtcgtgcga 

721 gggcagctgg gaggatgtgc tgcacctggg cgaggaggcg ccctcccacc tctactactg 

781 ccagctggag gccagtgcct gctacgtctt caccgagcag ctgggccgct ttgccctggt 

841 gggagaggcc ctcagcgtgg ctgccgccaa gcgcctcaag ctgcttctgt ttgcgccggt 

901 ggcctgcacc tccctcgagt acaacatccg ggtctactgc ctgcatgaca cccacgatgc 

961 actcaaggag gtggtgcagc tggagaagca gctgggggga cagctgatcc aggagccacg 

1021 ggtcctgcac ttcaaggaca gttaccacaa cctgcgccta tccatccacg atgtgcccag 

1081 ctccctgtgg aagagtaagc tccttgtcag ctaccaggag atcccctttt atcacatctg 

1141 gaatggcacg cagcggtact tgcactgcac cttcaccctg gagcgtgtca gccccagcac 

1201 tagtgacctg gcctgcaagc tgtgggtgtg gcaggtggag ggcgacgggc agagcttcag 

1261 catcaacttc aacatcacca aggacacaag gtttgctgag ctgctggctc tggagagtga 

1321 agcgggggtc ccagccctgg tgggccccag tgccttcaag atccccttcc tcattcggca 

1381 gaagataatt tccagcctgg acccaccctg taggcggggt gccgactggc ggactctggc 

1441 ccagaaactc cacctggaca gccatctcag cttctttgcc tccaagccca gccccacagc 

1501 catgatcctc aacctgtggg aggcgcggca cttccccaac ggcaacctca gccagctggc 

1561 tgcagcagtg gctggactgg gccagccaga cgctggcctc ttcacagtgt cggaggctga 

1621 gtgctgaggc cggccaggcc cgacacctac actctcacca gctttggcac ccaccaagga 

1681 caggcagaag ccggacaggg gcccttcccc acaccgggga gagctgctcg gacaggcccc 

1741 ctcccggccg aagctgtccc ttaatgctgg tccttcagac cctgcccgaa ctcccacctc 

1801 tccatggcct gcctagccag gctggcactg ccactcacac tcggccccag ggcccaggag 

1861 ggacagtgcc tggagcctgg gccaggccca gcccatctgt gtgtgtgtat gtgcgtgtga 

1921 tgctacctct cctcccgtcc ctctccaggg gccccgcata cacacggcca tgcacgcaca 

1981 cactgggcct gggccagggc cccagagctc ctgcctgagc tggaccttat gcaaacattt 

2041 ctgtgcctgc tgggtagggg cacgtctgag gggccctgct ccaagcctgc aggaccgagg 

2101 gccacagccg gacagggggt agcccctgga ttcaggcaca cgaccaccac acgagcacgt 

2161 gccacgcatg cctcgtgtgc tcatctcaca cacacccccc tcccgggtca cgcagacacc 

2221 ccccaaccac acacatctca tgccgtacac ctgaggctgc tcacgtctca cgcccagtgt 

2281 tggtgcacat ttgcctctca catgctgccc tctccaccca cccagggaca ccccacggct 

2341 cctccctgcc cctgcccctc ccccagcctt gaggtgccct gcccggcggg gcctgtgaat 

2401 atgcaatggg agtcccaggc tgtacagtgg tgagtgtgtg tgtggcgtgg cgtgcccgtc 

2461 cccagggctg gctggtgccc cacgcggggc ctgtcatgtg aagctcgtgt cctgactttg 



2521 tcttaagtgc attcacgcac ttactcttgg ccttatgtac acagccttgc ccggccgccg 
2581 gggcacatag gggttttatc gggcgtgaat gtaaataaat tatatatata tattgaaaaa 
2641 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 
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LOCUS (LOC) : 
GenBank ACC. no. (GBN) 
GenBank version (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE) : 
DEFINITION (DEF): 

SOURCE: 
ORGANISM (ORGN): 



GenBank (R) 
Gi: 14337476 



BG938104 
BG938104 
BG938104.1 
340854-24-2 
460 

mRNA ; linear 
Expressed sequence tag 
11 Jun 2001 

lAbollD02 Bovine Abomasum cDNA Library Bos taurus cDNA 
5 ' , mRNA sequence. 

COW. 

Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Cetartiodactyla; 
Ruminantia; Pecora; Bovoidea; Bovidae; Bovinae; Bos 
NUCLEIC ACID COUNT (NA) : 129 a 84 c 123 g 124 t 
COMMENT: 

Contact: Dr. Stephen Moore 
. Beef Genomics Laboratory 
Dept of AFNS, University of Alberta 

410 Agri/For, Dept of AFNS, U of A, Edmonton, AB, T6G 2P5, Canada 

Tel : 780 492 0169 

Fax: 780 492 4265 

Email : smoore@afns.ualberta.ca 

The sequence best matches gb:HS34B21 (Human DNA sequence from clone 
34B21 on chromosome 6pl2. 1-21.1. Contains part of a gene for a 
novel protein with ZU5 domain similar to part of Tight Junction 
Protein 201 (TJPl) and ***unc5*** Homologs, the gene for a novel BZR 

alue 

of le-40 

PCR PRimers 

FORWARD: M13 Forward 

BACKWARD: Ml3 Reverse 

seq primer: T3 primer 

High quality sequence stop: 460. 
reference: 1 (bases 1 to 460) 

AUTHOR (AU): Moore, S.S.; Hansen.O; Li,C; Fu,A.; Meng.Y.; Li , G . 

TITLE (TI) : cDNA's from bovine abomasum tissue 

JOURNAL (SO): Unpublished (2001) 



FEATURES (FEAT); 
Feature Key 



Location 



Qualifier 



source 1..460 /organism="Bos taurus" 

/db-xref="taxon:9913" 
/clone-lib="Bovine Abomasum cDNA 
Library" 

/sex="Two males and one female 
mixed" 

/tissue-type="Gastrointestinal 

tissue (GIT)" 

/eel 1 -type=" Epi thel i al 11 

/dev-stage=" Young adult" 

/l ab-host="XLl-BlueMRF ' -st rai n" 

/note="0rgan: Abomasum; Vector: 

uni-2ZAPXR; site-1: EcoR I; 

Site-2: Xho I" 

SEQUENCE (SEQ): 

1 gagatgggcg atatatatat tacctgatta caaagaaaag ggcttcacac aagecaaett 
61 atgaaaactt aeggaagagt ttagaggcta tgaagtccca ttgtctgaag aatggagtca 
121 ccgacctttc catgecaagg atcggatgtg gtcttgatcg tetgeaatgg gaaaatgtat 
181 ccgcaataat tgaggaggtc tttgaggcaa cagacatcag aattactgtg tacacactct 
241 gaacagaaga gcattttgga tgtgtctacc ttctgeatea cctgggccac aggactgagc 
301 aaactgacct taaaatggtc agaggaccgt ttcttgtgaa gtagtctgtg ttacaggeca 
361 gacctgggtt gctgtattct caggattgaa tgggactctg gaggaggggt ttcttgttgc 
421 taggaatttt tctttttcag gaaggtaagg ggagggacca 
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LOCUS (LOC): 

GenBank ACC. no. (gbn) 
GenBank VERSION (VER) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI): 
DATE (DATE): 
DEFINITION (DEF): 

SOURCE: 
ORGANISM (ORGN): 



NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI): 
JOURNAL (SO): 



UNC5 



( 



AF129475 GenBank (R) 
AF129475 

AF129475.1 Gl: 6002701 
243886-90-0 
600 

mRNA; linear 
Other vertebrates 
1 Oct 1999 

Petromyzon marinus netrin receptor 
***UNC5*** ) mRNA, partial cds. 
sea lamprey. 
Petromyzon marinus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Hyperoartia; Pet romyzonti formes ; Petromyzontidae; 
Petromyzon 

: 156 a 150 c 171 g 123 t 

1 (bases 1 to 600) 

Shi f man 5 M.I. ; Selzer,M.E. 

Expression of the netrin receptor UNC-5 selectively in 
poorly regenerating neurons following spinal 
transection in lamprey 

Neurorehabil . Neural Repair (1999) in press 

2 (bases 1 to 600) 

Shi f man, M.I. ; Selzer,M.E. 
Direct submission 

submitted (18-FEB-1999) Neurology, University of 
Pennsylvania, 452 Stemmler Hall, 36th Street and 
Hamilton Walk, Philadelphia, PA 19104, USA 



FEATURES (FEAT) : 
Feature Key 



Location 
.600 



<1. .>600 
<1. ->600 



Qualifier 



source 



gene 
CDS 



/organi sm=" Petromyzon mari nus" 

/db-xref= ,, taxon:7757 ,, 

/tissue-type="brain" 

/gene="UNC5" 

/gene="UNC5" 

/codon-start=3 

/product="netrin receptor UNC5 ,! 

/protei n-i d= n AAF00103 . 1" 

/db-xref="Gl: 6002702" 

/transl ation="HREEQARYIVKNKPVTMSCA 

ASPATQIYFKCNGEWLHQKAHHIE 

EREDETTGRSVREVQTDVSRQQVEELFGLEDYWC 

QCVAWS AAGTS KS RKAYVR LAYL R 

KNFEQKPLGKYALLDHEVLLHCRPPDAIPQAEVE 

WLKSEEIIDPVIDQNFYITVDHNL 

IIKQTRLADSANYTCVAKNLVAKRRSSTATITVY 

VNGGW" 



SEQUENCE (SEQ) : 

1 tacatcgtga agaacaagcc cgttacatcg 

61 ccgcctcgcc cgccacccag atctacttca 

121 cccatcacat tgaggagagg gaggatgaga 

181 cagacgtgtc tcggcagcag gtggaggagc 

241 gcgtcgcctg gagtgcagcc ggcaccagca 

301 atttgcggaa gaattttgag caaaagccat 

361 ttctgttgca ttgtcgccct cctgatgcca 

421 gtgaagaaat tattgaccca gtcatagatc 

481 taataattaa gcagactcgg ctggcagaca 

541 tggtggccaa gcggcggagc tctactgcca 



tgaagaataa gccggtcacc 
agtgcaacgg ggagtggctc 
caacagggcg gtcagtacgg 
tgtttgggct ggaggactac 
agagccgcaa ggcttatgtc 
tgggcaaata tgctcttctt 
ttccccaagc cgaggtggag 
aaaacttcta catcacggta 
gtgctaacta cacgtgcgtt 
cgatcaccgt gtatgtcaac 



atgagctgcg 
caccagaagg 
gaggttcaga 
tggtgtcaat 
cgcttagcat 
gaccatgaag 
tggttgaaaa 
gatcacaacc 
gccaagaatc 
ggcggctgga 
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LOCUS (LOC): 
GenBank acc. no. (gbn) 
GenBank version (ver) : 
CAS registry no. (rn): 

SEQUENCE LENGTH (SQL): 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE): 
DEFINITION (DEF): 

SOURCE: 
ORGANISM (ORGN) : 



GenBank (R) 



CEU62639 
U62639 
U62639.1 GI :4097486 
224328-12-5 
3935 

DNA; linear 
Invertebrates 
26 Jan 1999 

Caenorhabditis elegans lipoprotein receptor precursor 
(Ir) gene, complete cds. 
Caenorhabditis elegans. 
Caenorhabditis elegans 



NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU): 

TITLE (TI): 

JOURNAL (SO) : 

reference: 

AUTHOR (AU): 
TITLE (TI): 
JOURNAL (SO): 



Eukaryota; Metazoa; Nematoda; chromadorea; Rhabditida; 
Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis 
: 1100 a 830 c 841 g 1164 t 

1 (bases 1 to 3935) 
Tang , P . ; Ki ngston , I . B . 

Genomic organization of the Caenorhabditis elegans 

lipoprotein receptor (Ir) gene 

Unpublished 

2 (bases 1 to 3935) 
Tang , P . ; Ki ngston , I . B . 
Direct submission 

submitted (01-JUL-1996) Department of Parasitology, 
Chang Gung College of Medicine and Technology, 259 
Wen-nwa 1st Road, Kweishan, Taoyuan 333, Taiwan 



FEATURES (FEAT) : 

Feature Key Location 



Qualifier 



source 



1. .3935 



gene 

CAAT-signal 
CAAT-signal 

sig-peptide 

mRNA 



CDS 



203. .3934 
203. .207 
616. .620 

join(841. .897,946. .966) 
join(<841. .897, 
946.-1497,1544. .2393, 
2439. .2629,2680. .2888, 
2933..3934) 
join(841. .897, 
946. .1497,1544. .2393, 
2439. .2629,2680. .2888, 
2933. .3851) 



/organism="Caenorhabditis elegans" 

/strain="Bristol N2 n 

/db-xref="taxon:6239" 

/chromosome="IV" 

/map="between unc5 and rtw7" 

/clone="#CBl007" 

/note="see GenBank Accession 

Number U62281 for cDNA" 

/gene="lr" 

/gene="lr" 

/gene="lr" 

/note= f 'inverted" 

/gene= n lr M 

/gene="lr" 



/gene="lr" 



/note="similar to low density 

lipoprotein receptor" 

/codon-start=l 

/protei n-i d="AAD09364 . 1" 

/db-xref="GI:4097487" 

/trans 1 ation="MRTMRLAWLLPLFIHILIKN 

TAQAPAVNNSTCDQAKEFDCGNGR 

LRCIPAEWQCDNVADCDKGRDESGCSYAHHCSTS 

FMLCKNGLCVANEFKCDGEDDCRD 

GSDEQHCEYNILKSRFDGSNPSAPTTFVGHNGPE 

CHPPRLRCRSGQCIQPDLVCDGHQ 

DCSGGDDEVNCTRRGHENMQSSTDFHDDVHLVDP 

TFFANEDNKCRSGYTMCHSGDVCI 

PDSFLCDGDLDCDDASDEKNCQTNAPSEEEYLSG 

QADHMHSCSAAGMYSCGTKGSEIG 

VCIPMNATCNGIKECPLGDDESKHCSECARKRCD 

HTCMNTPHGARCICQEGYKLADDG 

LTCEDEDECATHGH LCQH FCEDRLGS FACKCANG 

YELETDGHSCKYEATTTPEGYLFI 

SLGGEVRQMPLADFTDGSNYSAIQKFAGHGTIRS 

IDFMHRNNKMFMSISDEHGDPTGE 

LSVSDNGLMRVLRENVIGVSNVAVDWIGGNVFFT 

QKSPSPSAGISICTMSGMFCRRVI 

EGKEQGQS YRGLWH PMRGLI IWIDS YQKYH RIM 

MANMDGSQVRI LLDNKLEVPSALA 

IDYIRHDVYFGDVERQLIERVNIDTKERRWISN 

GVHHPYDMAYFNGFLYWADWGSES 

LKVQEMTHHHSSPQVIHTFNRYPYGIAVNHSLYQ 

TGPPSNPCLELECPWLCVIVPKSD 

FIMTAKCVCPDGYTHSVTENSCIPPVTIEDEENL 

EKLSHIGSALMAEYCEAGVACMNG 

GACRELQNEHGRAHRIVCDCEGPYDGQYCERLNP 

EKFSAMEEEDSSLWLIVLLLIFLI 

I VAWGI I AFLWFSQQEHMKDVI STARVRVDNMA 

RKAEDAAAPIVEKFRKVTDKQRST 

PPREGCQTATNVDFVSYETNAEKRIRMDSSPTSY 

GNPMYDEVPESSTGFVRSASAPFA 



misc-feature 



misc-feature 



mat-peptide 



misc-feature 



misc-feature 



misc-feature 



misc-feature 



misc-feature 



misc-feature 



misc-feature 



join (967. .1497, 
1544. .1837) 



967. .1095 



join(967..1497, 

1544. .2393,2439. .2629, 

2680. .2888,2933. .3848) 

1096. .1215 



misc-feature 1285. .1401 



misc-feature join(1483. .1497, 
1544. .1651) 



join(1495. .1497, 
1544. .1555) 



1700. .1837 



misc-feature 1838. .1948 



join(1838. .2393, 
2439.-2629,2680. .2888, 
2933. .3431) 



1949. .2071 



misc-feature 2006. .2023 



join(2072. .2393, 
2439. .2629,2680. .2888, 
2933. .3251) 

3057. .3191 



misc-feature 3252 . . 3383 

misc-feature 3432. .3506 

misc-feature 3507 . . 3848 

misc-feature 3747. .3758 



pol yA-si gnal 3915 . . 3920 
polyA-site 3934 



GVIRFENDSLL" 
/gene="lr" 

/note="encodes ligand binding 

domain" 

/gene= ,, lr M 

/note="encodes class A ligand 

binding repeat" 

/gene=Hr" 

/product="l i poprotei n receptor" 
/gene="lr" 

/note="encodes class A ligand 
binding repeat" 
/gene=H r" 

/note="encodes class A ligand 
binding repeat" 
/gene=*l r" 

/note="encodes class A ligand 

binding repeat" 

/gene=Hr" 

/note="encodes class A ligand 

binding repeat" 

/gene=*lr" 

/note="encodes class A ligand 

binding repeat" 

/gene=*lr" 

/note="encodes class B.2 EGF-like 

repeat" 

/gene="lr" 



/note="encodes EGF precursor 
homology domain" 
/gene=*lr" 

/note="encodes class B.2 EGF-like 

repeat" 

/gene="lr" 

/note="encodes trans-splicing 

acceptor site" 

/gene="lr" 



/note="encodes YWTD spacer region" 
/gene="lr" 

/note="encodes class B.2 EGF-like 

repeat" 

/gene="lr" 

/note="encodes class B.l EGF-like 

repeat" 

/gene="l r" 

/note="encodes transmembrane 

domain" 

/gene="lr" 

/note="encodes cytoplasmic domain" 

/gene="lr" 

/note="encodes NPMY 

internalization signal" 

/gene="lr" 

/gene="lr" 



SEQUENCE (SEQ): 
1 aaaaatgtat 
61 ccgtaatttg 
121 cgtttccatc 
181 cctgtttttc 
241 aacaactgta 
301 gaggttccac 
361 cagatgctac 
421 caatgaatgg 
481 ttaaaaacaa 
541 ttgaagattt 
601 atctttaaaa 



gtctgatttt 
tatttttctg 
gtgtattttg 
ctcgaaaaga 
ataaactgaa 
aactttaaac 
actttctgcc 
cagatacatc 
attgcacaac 
cgaattaatt 
gatttattgg 



gaaatgctca 
aagatgagca 
ttaaaatatt 
ttccaatgtt 
tatattatct 
accaacaacg 
tgaaaaaaat 
aaagttttag 
gattctacta 
tgttcaatat 
aaaagataaa 



tttcctttga 
attcaatttt 
cacagattaa 
ctataattct 
atcgcatcgt 
caatcctaaa 
tctgaaaagc 
atgaacaatt 
ctgtcgcact 
tgtgttaaaa 
tggttaattt 



ggtttccatt 
taaattgccc 
cccatttacc 
acaaaacttc 
tttcaaccag 
tcatttgcaa 
cgaacaataa 
tttatgtatt 
aattttacgt 
tgtttgattt 
aaaccaaaaa 



tttgagttgc 
gcacctctac 
gtttcatcca 
ccacgcgaga 
aattaagcaa 
gattttattt 
ttcatggtaa 
aaatgtacat 
atgtctgtac 
atacactcaa 
tttccatcaa 



661 gccttttctg aaaacactaa aattattttc gtggtgggac caggcgcgcg cgtcccatga 
721 tgttccttta atcaaaatgc atttctgtcc cggcgggaga aattgaattt tgattttaag 
781 gcgcgaattt ttgcctaaaa acgatgccat tctttcattc ttttcataat ctcactcacc 
841 atgagaacca tgcgccttgc ttggttgctc ccacttttta ttcacatact aatcaaggta 
901 atttccccgt ttttctagtt ttttcaatgt attttcatgt ttcagaacac agctcaagct 
961 ccggctgtca acaactcgac atgcgatcaa gcaaaggaat ttgattgcgg gaacgggaga 
1021 ctccgatgca ttcccgcgga gtggcaatgc gacaacgtag cggactgcga caaaggaaga 
1081 gacgaatcgg gctgctcata tgcgcatcat tgttcgacaa gcttcatgtt atgcaagaat 
1141 ggactgtgtg tcgcaaatga gttcaaatgc gacggcgaag acgactgccg cgatggaagc 
1201 gatgagcagc attgcgagta caatatcctg aagtctcgct tcgatggttc caatccttcg 
1261 gctcctacca ctttcgttgg tcacaatggc ccagaatgcc atcctcctcg tttacgatgc 
1321 cgatcaggac aatgtattca accagatctc gtttgtgatg gacatcagga ttgttctgga 
1381 ggagatgatg aggtcaactg caccagaagg ggacatgaaa atatgcagtc ctcgactgat 
1441 tttcacgatg atgttcatct tgtcgatcca acctttttcg ctaatgaaga caataaggta 
1501 attgtttaat gtttattaat ccgttttaac ttttattttt cagtgtcgga gtggatacac 
1561 aatgtgccat agcggagacg tctgcatacc tgacagtttt ctttgtgacg gcgatctaga 
1621 ttgtgatgat gcttcggacg agaaaaactg ccaaactaat gctccaagcg aagaagaata 
1681 tctttctggg caagccgatc acatgcattc gtgctcagca gcaggaatgt attcttgtgg 
1741 aacaaaagga tccgaaattg gcgtttgtat tccgatgaat gccacgtgta atgggatcaa 
1801 ggagtgtcca ctaggagatg acgagtcaaa acattgctcc gaatgtgcca gaaagcgatg 
1861 tgaccacaca tgtatgaaca ctccacacgg ggctcgctgc atttgtcaag aaggatataa 
1921 gcttgccgat gacggactca cttgcgagga tgaagatgag tgtgcaactc atgggcactt 
1981 gtgccagcat ttctgtgaag atcgtttggg ttcctttgca tgcaaatgtg ccaacggtta 
2041 tgagcttgaa acggatgggc attcttgtaa atacgaggca accactacgc cagaaggata 
2101 tttgttcatc agtcttggtg gagaagttcg acagatgcca ttggcagatt tcaccgatgg 
2161 ttcaaattac tcggcgattc aaaagtttgc tggccacgga accatcagat cgatcgactt 
2221 catgcatcgc aacaacaaaa tgttcatgtc aatttctgat gagcacggtg atccaactgg 
2281 cgaattgtca gtgtccgaca atggattgat gagagttctt cgagaaaatg tcattggagt 
2341 gagcaacgtg gcagtcgact ggattggtgg aaacgttttc ttcacacaaa aatgtatgtt 
2401 tatctaatgt ttaaattttt catttgtgat tcttacagct ccatctccaa gcgctgggat 
2461 ttccatctgc acaatgagcg gaatgttctg tcgccgagtt atcgaaggca aagaacaagg 
2521 acaatcctat cgtggtcttg ttgttcaccc gatgcgcggt ctcatcatct ggatcgattc 
2581 ttatcagaaa tatcatcgca tcatgatggc taatatggat gggtctcagg tgagtcgatc 
2641 gagtcgatct gatttagttc atttctaaat aaatttcagg tcagaatcct tctcgacaac 
2701 aagttggaag ttccatcagc tcttgccatc gactacatcc gccacgatgt ctattttgga 
2761 gatgttgaac gtcagttgat cgaaagagtc aatatcgaca cgaaagagcg ccgcgtagtg 
2821 atttcgaacg gagttcatca tccgtatgac atggcttact tcaatggttt cctatactgg 
2881 gcagattggt aagacatctt atctaattta tattttcaaa tttatttttc aggggaagcg 
2941 agtcattaaa ggttcaagag atgacccatc atcattcgag tcctcaagtc atccatactt 
3001 tcaatcgtta tccatatggt attgctgtca atcactcact ctaccagact ggtcctccat 
3061 caaacccatg ccttgaactc gagtgcccat ggctctgcgt tattgtgcca aagagcgatt 
3121 tcattatgac tgccaagtgt gtctgcccag acggatacac tcattccgtc actgaaaact 
3181 cttgcatccc gcctgtgacg attgaggacg aggagaacct tgagaagctt tcccacattg 
3241 gatctgcttt gatggccgaa tactgcgaag ctggtgtcgc gtgtatgaat ggaggagcct 
3301 gccgtgaact acaaaatgag cacggaagag ctcatcgcat cgtttgtgat tgtgagggtc 
3361 catatgacgg gcaatactgc gaacggctca atccagagaa gttctccgca atggaagagg 
3421 aagattcgtc cttatggctt atcgttctgc ttctcatttt tctcatcatc gttgcggtag 
3481 tcggaattat tgccttcctt tggttttctc aacaagagca tatgaaagat gtgatttcca 
3541 ctgcccgtgt ccgtgttgat aacatggcta gaaaagcgga agatgctgca gctccaattg 
3601 tcgagaagtt ccgcaaggtc actgataagc agaggagcac gcctcctaga gaaggttgtc 
3661 aaacggcaac aaacgttgac ttcgtttcct acgagacaaa tgctgagaaa agaattcgga 
3721 tggactcttc gccgacgtca tacggaaacc ccatgtacga tgaagttcct gaatcgtcaa 
3781 ctggtttcgt cagatcggct tccgcaccat tcgctggagt cattcgattt gagaacgaca 
3841 gcttgttgtg aattctacta caaaattact aaatcagatg tctgtaaagt atatctattt 
3901 ttgcctattt attgcatgaa agttgataat gtcta 
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